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=> fil reg 

FILE 'REGISTRY' ENTERED AT 10:57:07 ON 02 OCT 2003 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS, 

COPYRIGHT (C) 2003 American Chemical Society (ACS) 

Property values tagged with IC are from the ZIC/VINITI data file 
provided by InfoChem. 

STRUCTURE FILE UPDATES: 1 OCT 2003 HIGHEST RN 596788-60-2 

DICTIONARY FILE UPDATES: 1 OCT 2003 HIGHEST RN 596788-60-2 

TSCA INFORMATION NOW CURRENT THROUGH JULY 14, 2003 

Please note that search-term pricing does apply when 
conducting SmartSELECT searches. 

Crossover limits have been increased. See HELP CROSSOVER for details. 

Experimental and calculated property data are now available. See HELP 
PROPERTIES for more information. See STNote 27, Searching Properties 
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=> d que 11 

LI 2 SEA FILE=REGISTRY ABB=ON PLU=ON ELNFSTGW/ SQSP 



=> d 11 sqide 1-2 

Ll ANSWER 1 OF 2 REGISTRY COPYRIGHT 2 003 ACS on STN 
RN 57 7987-91-8 REGISTRY 

CN Glycine, L- . alpha . -glutamyl-L-leucyl-L-asparaginyl-L-phenylalanyl-L-seryl- 

L-threonylglycyl-L-tryptophylglycyl- (9CI) ' (CA INDEX NAME) 
OTHER NAMES: 

CN 36: PN: WO03066080 SEQID: 36 unclaimed sequence 
FS PROTEIN SEQUENCE; STEREOSEARCH 
SQL 10 

SEQ 1 ELNFSTGWGG 



HITS AT: 1-8 

MF C48 H66 N12 016 

SR CA 

LC STN Files: CA, CAPLUS, USPATFULL 
Absolute stereochemistry. 



/ Structure 1 in file . gra / 



**PROPERTY DATA AVAILABLE IN THE 'PROP' FORMAT** 

1 REFERENCES IN FILE CA (1907 TO DATE) 



1 REFERENCES IN FILE CAPLUS (1907 TO DATE) 

Ll ANSWER 2 OF 2 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 149924-80-1 REGISTRY 

CN Chromatophorotropin, red-pigment-concentrating (Pandalus borealis) 

l-L-glutamic acid-6-L-threonine-8-L-tryptophan- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 4: PN: WO03066080 SEQID: 4 unclaimed sequence 

CN Protein (Schistocerca gregaria adipokinetic hormone II) 

FS PROTEIN SEQUENCE; STEREOS EARCH 

SQL 8 

SEQ 1 ELNFSTGW 



HITS AT: 1-8 

MF C44 H60 NIO 014 

SR CA 

LC STN Files: CA, CAPLUS, USPATFULL 
Absolute stereochemistry. 



/ Structure 2 in file . gra / 

2 REFERENCES IN FILE CA (1907 TO DATE) 

2 REFERENCES IN FILE CAPLUS (1907 TO DATE) 



=> fil hcpalus 

'HCPALUS' IS NOT A VALID FILE NAME 
SESSION CONTINUES IN FILE 'REGISTRY* 

=> fil hcaplus 

FILE 'HCAPLUS' ENTERED AT 10:57:20 ON 02 OCT 2003 

USE IS SUBJECT TO THE TERMS OF YOUR STN CUSTOMER AGREEMENT. 

PLEASE SEE "HELP USAGETERMS" FOR DETAILS. 

COPYRIGHT (C) 2003 AMERICAN CHEMICAL SOCIETY (ACS) 



Copyright of the articles to which records in this database refer is 
held by the publishers listed in the PUBLISHER (PB) field (available 
for records published or updated in Chemical Abstracts after December 
26, 1996), unless otherwise indicated in the original publications. 
The CA Lexicon is the copyrighted intellectual property of the 
the American Chemical Society and is provided to assist you in searching 
databases on STN. Any dissemination, distribution, copying, or storing 
of this information, without the prior written consent of CAS, is 
strictly prohibited. 

FILE COVERS 1907 - 2 Oct 2003 VOL 139 ISS 14 
FILE LAST UPDATED: 1 Oct 2003 ( 2 0031001/ED) 

This file contains CAS Registry Numbers for easy and accurate 
substance identification. 
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LI 2 SEA FILE^REGISTRY ABB=ON PLU=ON ELNFSTGW/SQSP 

L2 2 SEA FILE=HCAPLUS ABB=ON PLU=ON LI 



=> d -ca 12 1-2 
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ANSWER 1 OF 2 HCAPLUS COPYRIGHT 2003 ACS on STN 



ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 



INVENTOR (S) : 
PATENT ASSIGNEE(S): 
SOURCE: 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2003:633493 HCAPLUS 

139:173813 

Insect adipokinetic hormone peptide compositions and 
methods for promoting lipid mobilization, glycogen 
mobilization, or both, in humans, and screening 
methods 

Schacter, Bernice Z.; Schacter, Lee P. 

ELM Group, USA 

POT Int. Appl., 82 pp. 

CODEN: PIXXD2 

Patent 

English 
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IT 



MARPAT 139:173813 

The invention provides methods of using polypeptide compds . based on the 
structures of insect peptides of the adipokinetic hormone family in humans 
for a variety of purposes, including mobilization of lipids and glycogen, 
inhibition of protein, lipid, and RNA synthesis, and enhancement of muscle 
contractility. The compns . and methods are useful e.g. for modulating 
human body wt., inducing wt . loss, and alleviating glycogen storage 
disorders. The invention also includes screening methods for identifying 
other compds. having similar activities in humans - 
ICM A61K038-03 

ICS A61K038-10; C07K007'-06; C07K007-08; G01N033-68 
1-10 (Pharmacology) 

Section cross-reference (s ) : 12, 63 

22006-64-0, . alpha . 1-13-Corticotropin 123466-63-7, Melanin-concentrating 
hormone (human reduced) ***149924-80-l*** 577986-95-9 577986-98-2 

577987-01-0 577987-06-5 577987-09-8 577987-16-7 577987-19-0 
577987-22-5 577987-25-8 577987-28-1 577987-31-6 577987-36-1 
577987-39-4 577987-44-1 577987-47-4 577987-52-1 577987-55-4 



577987-58-7 577987-65-6 577987-68-9 577987-72-5 577987-75-8 
577987-80-5 577987-83-8 577987-8 6-1 ***577 987-91- 8*** 
577987-94-1 577987-99-6 577988-02-4 
RL: PRP (Properties) 

(unclaimed sequence; insect adipokinetic hormone peptide compns , and 
methods for promoting lipid mobilization, glycogen mobilization, or 
both, in humans, and screening methods) 
REFERENCE COUNT: 4 THERE ARE 4 CITED REFERENCES AVAILABLE FOR THIS 

RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 



L2 ANSWER 2 OF 2 HCAPLUS COPYRIGHT 2003 ACS on STN 
ACCESSION NUMBER: 1993:599933 HCAPLUS 

DOCUMENT NUMBER: 119:199933 

TITLE: AKH biosynthesis: transcriptional and translational 

control of two co-localized prohormones 

AUTHOR(S): Fis cher-Lougheed, Jacqueline; O'Shea, Michael; 

Cornish, Ian; Losberger, Christophe; Roulet, 
Emmanuelle; Schulz-Aellen, Marie Francoise 

CORPORATE SOURCE: Sch, Biol. Sci., Univ. Sussex, Brighton, BNl 9QG, UK 

SOURCE: Journal of Experimental Biology (1993), 177, 223-41 

CODEN: JEBIAM; ISSN: 0022-0949 

DOCUMENT TYPE: Journal 

LANGUAGE: English 

AB The neurosecretory cells of the locust corpora cardiaca (CC) express w 
co-localized transcripts which are translated into the 2 preprohormones 
required in adipokinetic hormone I (AKH I) and AKH II biosynthesis. At 
different stages of postembryonic development, the relative amts . of the 2 
transcripts (AKH I mRNA andAKH II mRNA) change in parallel with the 
relative rates of synthesis of proAKH I and proAKH II. Differential 
regulation of transcript expression, however, cannot account for the 
changes in neuropeptide ratios seen during postembryonic development. 
Comparison of in vivo and in vitro translation shows that protein 
synthesis in vivo is biased towards the translation of AKH I mRNA by a 
factor of .apprx.2.6. This factor appears to be const, during 
postembryonic development and is required to produce the obsd. 
developmental changes in neuropeptide ratios. Both transcriptional and 
translational mechanisms are therefore necessary to alter neuropeptide 
ratios in the CC. The mechanisms can account for the developmentally 
changing pattern of peptide expression. Regulation of neuropeptide ratios 
indicates that signalling functions can be attributed to the precise 
configuration of peptide cocktails - 

CC 12-3 (Nonmammalian Biochemistry) 
Section cross-reference ( s ) : 3, 6 

IT 130002-73-2, Protein ( Schistocerca gregaria prepro-adipokinetic hormone I) 
130093-13-9, Protein (Schistocerca gregaria prepro-adipokinetic hormone I 
C-terminal . alpha . -chain) ***149924-80-l*** , Protein (Schistocerca 
gregaria adipokinetic hormone II) 149924-81-2, Protein (Schistocerca 
gregaria adipokinetic hormone I) 150295-31-1, Protein (Schistocerca 
gregaria prepro-adipokinetic hormone II C-terminal . beta . -chain) 
150679-74-6 150679-75-7, Protein (Schistocerca gregaria pro-adipokinetic 
hormone II) 150679-76-8, Protein (Schistocerca gregaria pro-adipokinetic 
hormone I) 

RL: BIOL (Biological study) 

(amino acid sequence and mRNA for, in development) 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 2, 2003, 08:40:01 ; Search time 83 Seconds 

(without alignments) 
15.299 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-072-419~4 
47 

1 ELNFSTGW 8 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1107863 seqs, 158726573 residues 

Total number of hits satisfying chosen parameters: 1107863 

Minimum DB seq length: 0 ' 
Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 



Database : A_Geneseq_19 Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1980 , DAT : * 

2 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl981 .DAT: * 

• 3: /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1982 .DAT: * 

4 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983 . DAT : * 

5: /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1984 .DAT: * 

6: /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1985 .DAT: * 

7 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1986.DAT: * 

8 : /SIDSl/gcgdata/geneseq/geneseqp-embl/7\Al987 . DAT: * 

9: /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl988 .DAT: * 
10 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1989 . DAT : * 
11 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1990 . DAT : * 
12 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl991 . DAT : * 
13 : /SIDSl/gcgdata/geneseq/geneseqp-embl/7\A1992 . DAT : * 
14 : /SIDSl/gcgdata/geneseq/geneseqp~embl/AA1993 . DAT : * 
15 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 . DAT : * 
16 : /SIDSl/gcgdata/geneseq/geneseqp-embl/7^1995 . DAT : * 
17 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AAl996. DAT: * 
18 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 . DAT : * 
19 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 . DAT : * 
20 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA1999 . DAT : * 
21 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2000 . DAT : 
22 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2001 . DAT: * 
2-3: /SIDSl/gcgdata/geneseq/geneseqp~embl/AA2002 . DAT : * 
24 : /SIDSl/gcgdata/geneseq/geneseqp-embl/AA2003 . DAT : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed. 



and is derived by analysis of the total score distribution. 

SUMMARIES 
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Match 
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Description 
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80. 
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AAY31073 


Non-crosslinked pr 
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76. 
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21 
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76. 
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23 
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74. 
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22 


AAO01790 


Human polypeptide 
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Human matrix metal 
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AAY70007 
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Bacillus subtilis 
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AAY83900 


Human HS6ST1 prote 


84 


32 


68. 


1 


438 


23 


ABB89683 


Human polypeptide 


85 


' 32 


68. 


1 


459 


22 


AAB93310 


Human protein sequ 


86 


32 


68. 


1 


459 


23 


ABB06194 


Human HS6ST2v prot 


87 


32 


68. 


1 


459 


23 


AAE14449 


Human drug metabol 


88 


32 


68. 


1 


460 


24 


ABP71362 


Human AK polyp epti 


89 


32 


68. 


1 


4 62 


22 


AAB94993 


Human protein sequ 


90 


32 


68. 


1 


465 


22 


AAB88474 


Human membrane or 


91 


32 


68. 


1 


465 


24 


AAO26608 


Human 6-OST-2A pro 


92 


32 


68 . 


1 


469 


24 


ABP71364 


Mouse H6ST-3 prote 


93 


32 


68. 


1 


470 


21 


AAY8 39 02 


Mouse HS6ST2 prote 


94 


32 


68. 


1 


471 


24 


AAO26610 


Human 6-OST-3 prot 


95 


32 


68. 


1 


499 


23 


AAE21048 


Human drug metabol 


96 


32 


68. 


1 


499 


23 


ABB06195 


Human HS6ST2 prote 


97 


32 


68. 


1 


504 


24 


AAO26609 


Human 6-OST-2B pro 


98 


32 


68. 


1 


506 


21 


AAY83901 


Mouse HS6ST1 prote 


99 


32 


68. 


1 


506 


24 


ABP71363 


Mouse H6ST-2 prote 


100 


32 


68. 


1 


508 


23 


AAE21053 


Human drug metabol 



ALIGNMENTS 



RESULT 1 
AAY31073 



ID AAY31073 standard; peptide; 10 AA. 
XX 

AC AAY31073; 

XX 

DT 21-OCT-1999 (first entry) 
XX 

DE Non-crosslinked protein particle peptide 122. 
XX 

KW Non-crosslinked protein particle; diagnostic; therapy; monodisperse; 

PvW albumin; haemoglobin; nanometer; micrometer; clearance. 

XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Modified- site 1 

FT /note= "pyroglutamic acid" 

FT Modified-site 10 

FT /note=^ "C- terminal amide" 
XX 

PN US5945033~A. 

XX 

PD 31-AUG~1999. 
XX 

PF 12-NOV-1996; 96US-0747 137 . 
XX 

PR 14-MAR-1994; 94US-0212546 . 

PR 15-JAN-1991; 91US-0 64 172 0 . 

PR 13-OCT-1992; 92US-0 959560 . 

PR Ol-JUN-1993; 93US-0069831 . 

PR 12-NOV-1996; 96US-0747137 . 
XX 

PA (HEMO-) HEMOSPHERE INC. 
XX 

PI Yen RCK; 
XX 

DR WPI; 1999-508153/42. 

XX 

PT Non-crosslinked protein particles for therapeutic and diagnostic use 
XX 

PS Example 22; Column 103-104; 65pp; English. 

XX 

CC This invention describes a novel aqueous suspension of monodisperse 

CO particles on non-crosslinked, non-denatured albumin (50-5000 nm) which 

CC is stable against dissolving upon dilution with an alcohol-free aqueous 

CC medium. The method involves (a) forming an aqueous solution containing 

CC albumin and hemoglobin and (b) treating the aqueous solution with an 

CC alcohol to cause the solution to become turbid. The particles are useful 

CC as agents for in vivo administration, either of their own administration 

CC or as a vehicle for other therapeutic or diagnostic agents. The method 

CC permits the formation of albumin and hemoglobin particles in the 

CC nanometer and micrometer size range,.' in a form closer to their natural 

CC form than the forms of the prior art. The particles therefore constitute 

CC a more closely controlled agent for in vivo administration, with greater 

CC ease of clearance from the body after their period of usefulness. 

CC AAY30952-Y31135 represent peptides used in the method of the invention. 

XX 

SQ Sequence 10 AA; 



Query Match 80.9%; Score 38; DB 20; Length 10; 

Best Local Similarity 75.0%; Pred. No. 0.64; 

Matches 6; Conservative 1; Mismatches 1; Indels 0; Gaps 

Qy 1 ELNFSTGW 8 

I : I I I II 
Db 1 EVNFSPGW 8 



Search completed: October 2, 2003, 08:50:39 
Job time : 86 sees 

GenCore version 5.1,6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 2, 2003, 08:49:12 ; Search time 29 Seconds 

(without alignments) 
11-672 Million cell updates/sec 

US-10-072-419-4 
47 

1 ELNFSTGW 8 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 
328717 seqs, 42310858 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 



328717 



Database : Issued_Patents_AA: * 

1 : /cgn2__6/ptodata/l/iaa/5A_COMB. pep : * 

2: /cgn2_6/ptodata/l/iaa/5B_COMB.pep:* 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB . pep : 

6 : /cgn2_6/ptodata/l/iaa/backf ilesl .pep : * 

Pred. No. is the nuitiber of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. Score Match Length DB ID 



Description 



1 


35 


74 . 


5 


376 


4 


US- 09-564-954-2 


Sequence 


2, Appli 


2 


34 


72 . 


3 


339 


4 


US- 09-252-9 9 lA-2 9 08 5 


Sequence 


29085, A 


3 


33 


70 . 


2 


10 


2 


US- 08-747-137-122 


Sequence 


122, App 


4 


33 


70 . 


2 


300 , 


4 


US-09-091-501B-11 


Sequence 


11, Appl 


5 


33 


70 . 


2 


300 


4 


US- 09- 09 1-50 IB- 12 


Sequence 


12, Appl 


6 


33 


7 0 - 


2 


300 


4 


US- 09- 09 1-50 IB- 13 


Sequence 


13, Appl 


7 


33 


70 . 


2 


380 


4 


US-09-134-001C-3830 


Sequence 


3830, Ap 


8 


33 


70 . 


2 


2008 


4 


US- 09- 09 1-50 IB- 8 


Sequence 


8, Appli 


9 


33 


70 . 


2 


3433 


4 


US- 0 9- 09 1-50 IB- 10 


Sequence 


10, Appl 


10 


32 . 5 


69 . 


1 


990 


1 


US-08-232-540-2 


Sequence 


2, Appli 


11 


32 . 5 


69 . 


1 


990 


1 


US-08-428-949A-2 


Sequence 


2, Appli 


12 


32 , 5 


69- 


1 


990 


1 


US-08-428-94 8A-2 


Sequence 


2, Appli 


13 


32 . 5 


69 . 


1 


990 


2 


us- 08-42 8- 94 6-2 


Sequence 


2, Appli 


14 


32.5 


69 . 


1 


990 


5 


PCT-US95-04656-2 


Sequence 


2, Appli 


15 


32 . 5 


69 . 


1 


1013 


1 


US-08~233-008A-8 


Sequence 


8, Appli 


16 


32 


68 . 


1 


91 


3 


us- 0 8- 8 5 1-8 4 3A- 220 


Sequence 


220, App 


17 


32 


68 . 


1 


91 


3 


US-08-974-549A-339 


Sequence 


339, App 


18 


32 


68 . 


1 


91 


3 


US-0 8-85 4-050-2 20 


Sequence 


220, App 


19 


32 


68 . 


1 


91 


4 


US- 09-4 30-323-22 0 


Sequence 


220, App 


20 


32 


68 . 


1 


434 


4 


US- 0 9-2 52- 99 lA- 18 Oil 


Sequence 


18011, A 


21 


32 


68 . 


1 


820 


4 


US-09-2 52-991A-25454 


Sequence 


25454, A 


22 


31 


66. 


0 


241 


4 


US- 09-32 8-3 52- 5 020 


Sequence 


5020, Ap 


23 


31 


66 . 


0 


265 


4 


US- 09-7 4 2-693- 30 


Sequence 


30, Appl 


24 


31 


66 . 


0 


389 


4 


US- 0 9-32 8- 3 52-7 099 


Sequence 


7099, Ap 


25 


31 


66 . 


0 


535 


3 


US -08-813-574 -2 


Sequence 


2, Appli 


26 


31 


66 . 


0 


997 


4 


US- 0 9-2 52- 99 lA- 3 07 9 9 


Sequence 


30799, A 


27 


31 


66. 


0 


2710 


1 


US-08-48 0-604A-6 


Sequence 


6, Appli 


28 


31 


66. 


0 


2710 


2 


US-08-4 05-496A-6 


Sequence 


6, Appli 


29 


31 


66 , 


0 


2710 


3 


US -08-915-13 6- 6 


Sequence 


6, Appli 


30 


31 


66 . 


0 


2710 


4 


US-08-957-310-6 


Sequence 


6, Appli 


31 


31 


66 . 


0 


2710 


4 


US -10-01 1-36 6- 6 


Sequence 


6, Appli 


32 


31 


66 . 


0 


2864 


4 


US-08-469-260A-394 


Sequence 


394, App 


33 


31 


66 . 


0 


2864 


4 


US- 08-4 8 8-4 4 6-3 94 


Sequence 


394, App 


34 


31 


66 , 


0 


2864 


4 


US-08-467-344A-394 


Sequence 


394, App 


35 


30 


63 . 


8 


38 


2 


US-08-867-087B-44 


Sequence 


44, Appl 


36 


30 


63 . 


8 


133 


4 


US -09-072-59 6- 283 


Sequence 


2 83, App 


37 


30 


63 . 


8 


200 


4 


US-09-252-991A-26936 


Sequence 


26936, A 


38 


30 


63 . 


8 


211 


4 


US- 0 9-2 52- 9 9 1A~ 3 002 9 


Sequence 


30029, A 


39 


30 


63 , 


8 


307 


2 


US-0 8-7 82-7 60-6 


Sequence 


6, Appli 


40 


30 


63 . 


8 


307 


5 


PCT-US96-00995-6 


Sequence 


6, Appli 


41 


30 


63 . 


8 


396 


1 


US- 07-64 9-59 lB-4 


Sequence 


4, Appli 


42 


30 


63. 


8 


396 


1 


US-08-277-540-4 


Sequence 


4, Appli 


43 


30 


63. 


8 


396 


1 


US-08-430-787A-4 


Sequence 


4, Appli 


44 


30 


63 . 


8 


518 


4 


US-09-134-001C-4348 


Sequence 


4348, Ap 


45 


30 


53 . 


8 


607 


1 


US- 07- 9 5 9- 94 3-7 


Sequence 


7, Appli 


46 


30 


63. 


8 


607 


1 


US-07-879-617A-12 


Sequence 


12, Appl 


47 


30 


63. 


8 


607 


1 


US-08-753-985-12 


Sequence 


12, Appl 


48 


30 


63. 


8 


630 


1 


US-07-959-943-9 


Sequence 


9, Appli 


49 


30 


63. 


8 


630 


1 


US-07-959-943-11 


Sequence 


11, Appl 


50 


30 


63. 


8 


807 


4 


US-09-177-650-3 


Sequence 


3, Appli 


51 


30 


63. 


8 


866 


2 


US-08-483-101~4 


Sequence 


4, Appli 


52 


30 


63. 


8 


1711 


2 


US-08-342-930-2 


Sequence 


2, Appli 


53 


30 


63. 


8 


1881 


4 


US-09-233-086~3 


Sequence 


3, Appli 


54 


29 


61. 


7 


39 


2 


US-08-867-087B-42 


Sequence 


42, Appl 


55 


29 


61. 


7 


40 


2 


US-08-867-087B-43 


Sequence 


43, Appl 



56 


29 


61. 


7 


126 


4 


US- 


09- 


554- 


765-7 


Sequence 


1, Appli 


57 


29 


61. 


7 • 


147 


4 


US- 


09- 


554- 


765-10 


Sequence 


10, Appl 


58 


29 


61. 


7 


189 


1 


US- 


08- 


709- 


912-13 


Sequence 


13, Appl 


59 


29 


61. 


7 


189 


2 


US- 


09- 


047- 


370-13 


Sequence 


13, Appl 


60 


29 


61. 


7 


235 


4 


us- 


09- 


328- 


352-6859 


Sequence 


6859, Ap 


61 


29 


61. 


7 


250 


2 


us- 


08- 


867- 


087B-13 


Sequence 


13, Appl 


62 


29 


61. 


7 


261 


4 


us- 


09- 


252- 


991A-33140 


Sequence 


33140, A 


63 


29 


61. 


7 


263 


2 


us- 


08- 


206- 


790A-23 


Sequence 


23, Appl 


64 


29 


61. 


7 


263 


4 


us- 


09- 


311- 


784A-28 


Sequence 


28, Appl 


65 


29 


61. 


7 


263 


5 


PCT 


-US95-02943-23 


Sequence 


23, Appl 


66 


29 


61. 


7 


302 


4 


us~ 


09- 


585- 


858-54 


Sequence 


54, Appl 


67 


29 


61. 


7 


303 


3 


us- 


08- 


818- 


112-92 


Sequence 


92, Appl 


68 


29 


61. 


7 


303 


4 


us- 


08- 


818- 


111-93 


Sequence 


93, Appl 


69 


29 


61.7 


303 


4 


us- 


09- 


056- 


556-92 


Sequence 


92, Appl 


70 


29 


61. 


7 


303 


4 


us- 


09- 


072- 


596-93 


Sequence 


93, Appl 


71 


29 


61. 


7 


306 


1 


us- 


08- 


696- 


139-4 


Sequence 


4, Appli 


72 


29 


61. 


7 


310 


4 


us- 


09- 


328- 


352-5485 


Sequence 


54 85, Ap 


73 


29 


61. 


7 


312 


1 


us~ 


08- 


525- 


5 0 5 A- 4 


Sequence 


4, Appli 


74 


29 


61. 


7 


314 


1 


us- 


08- 


525- 


505A-2 


Sequence 


2, Appli 


75 


29 


61. 


7 


321 


4 


us- 


09- 


252- 


991A-26066 


Sequence 


26066, A 


76 


29 


61. 


7 


329 


4 


us- 


09- 


011- 


769A-51 


Sequence 


51, Appl 


77 


29 


61. 


7 


331 


4 


us- 


09- 


252- 


991A-28 430 


Sequence 


28430, A 


78 


29 


61. 


7 


346 


2 


us- 


07- 


952- 


853-24 


Sequence 


24, Appl 


79 


29 


61. 


7 


346 


2 


us- 


08- 


914- 


848-24 


Sequence 


24, Appl 


80 


29 


61. 


7 


349 


4 


us- 


09- 


011- 


769A-47 


Sequence 


47, Appl 


81 


29 


61. 


7 


349 


4 


us- 


09- 


011- 


7 69A-60 


Sequence 


60, Appl 


82 


29 


61. 


7 


349 


4 


us- 


09- 


011- 


7 69A-64 


Sequence 


64, Appl 


83 


29 


61. 


7 


367 


4 


us- 


09- 


328- 


352-7478 


Sequence 


7478, Ap 


84 


29 


61. 


7 


372 


4 


us- 


09- 


252- 


991A-18963 


Sequence 


18963, A 


85 


29 


61. 


7 


383* 


3 


us- 


09- 


357- 


251-26 


Sequence 


26, Appl 


86 


29 


61. 


7 


404 


1 


us- 


08- 


696- 


139-2 


Sequence 


2 , Appli 


87 


29 


61. 


7 


409 


4 


us- 


09- 


554- 


765-14 


Sequence 


14, Appl 


88 


29 


61. 


7 


415 


2 


us- 


08- 


860- 


882A-57 


Sequence 


57, Appl 


89 


29 


61.7 


415 


4 


us- 


09- 


011- 


769A-39 


Sequence 


39, Appl 


90 


29 


61.7 


424 


4 


us~ 


09- 


011- 


769A-56 


Sequence 


56, Appl 


91 


29 


61. 


7 


432 


3 


us- 


09- 


306- 


595C-8 


Sequence 


8, Appli 


92 


29 


61. 


7 


432 


4 


us- 


09- 


925- 


388-8 


Sequence 


D, Appli 


93 


29 


61. 


7 


477 


4 


us~ 


09- 


328- 


352-6549 


Sequence 


6549, Ap 


94 


29 


61. 


7 


489 


4 


us- 


09- 


328- 


352-5088 


Sequence 


5088, Ap 


95 


29 


61. 


7 


538 


4 


us- 


09- 


252- 


991A-23060 


Sequence 


23060, A 


96 


29 


61. 


7 


549 


4 


us- 


09- 


107- 


532A-7304 


Sequence 


7304, Ap 


97 


29 


61. 


7 


558 


4 


us- 


09- 


328- 


352-5987 


Sequence 


5987, Ap 


98 


29 


61. 


7 


580 


1 


us- 


08- 


309- 


512-6 


Sequence 


6, Appli 


99 


29 


61. 


7 


580 


5 


PCT 


-US92-08756A-6 


Sequence 


6, Appli 


100 


29 


61. 


7 


613 


3 


us- 


09- 


171- 


945-113 


Sequence 


113, App 



ALIGNMENTS 



RESULT 1 
US-09-564-954-2 

; Sequence 2, Application US/09564954 

; Patent No. 6406889 

; GENERAL INFORMATION: 

; APPLICANT: Bae, Weonhye 

; APPLICANT: Van Horn, Stephanie 



APPLICANT: Warren, Richard L. 
APPLICANT: Biswas^ San joy 
APPLICANT: Throup, John P. 
APPLICANT: Burnham, Martin K. R. 
TITLE OF INVENTION: 509HK 
FILE REFERENCE: GM10220HK 

CURRENT APPLICATION NUMBER: US/09/564,954 
CURRENT FILING DATE: 2000-05-04 
PRIOR APPLICATION NUMBER: US 60/132,935 
PRIOR FILING DATE: 1999-05-06- 

NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 4.( 
SEQ ID NO 2 
LENGTH: 376 
TYPE: PRT 

ORGANISM: Staphylococcus aureus 
US-09-564-954-2 



Query Match 74.5%; 
Best Local Similarity 83.3%; 
Matches 5; Conservative 



Score 35; DB 4; Length 376; 
Pred. No. 43; 
1; Mismatches 0; Indels 



Qy 

Db 



3 NFSTGW 8 

I I : I I I 
45 NFTTGW 50 



RESULT 3 

US-08-747-137-122 

; Sequence 122, Application US/08747137 

; Patent No. 5945033 

; GENERAL* INFORMATION: 

; APPLICANT: YEN, Richard C.K. 

TITLE OF INVENTION: NON-CROSSLINKED PROTEIN PARTICLES FOR 

; TITLE OF INVENTION: THERAPEUTIC AND DIAGNOSTIC USE 

; NUMBER OF SEQUENCES: 184 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Townsend and Townsend and Crew LLP 

; STREET: Two Embarcadero Center, 8th Floor 

; CITY: San Francisco 

; STATE: CA 

; COUNTRY: USA 

; ZIP: 94111 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: PatentIn Release #1.0, Version #1.30 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/747,137 

; FILING DATE: 12-NOV-1996 

; CLASSIFICATION: 424 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 08/212,546 

; FILING DATE: 14-MAR-1994 

; PRIOR APPLICATION DATA: 



; APPLICATION NUMBER: US 08/069,831 

; FILING DATE: Ol-JUN-1993 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 07/959,560 

; FILING DATE: 13-OCT-1992 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 07/641,720 

; FILING DATE: 15-JAN-1991 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Apple, Randolph T. 

; REGISTRATION NUMBER: 36,429 

; REFERENCE/DOCKET NUMBER: 016197-000840US 

; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 415-576-0200 

; INFORMATION FOR SEQ ID NO: 122: 

SEQUENCE CHARACTERISTICS: 

; LENGTH: 10 amino acids 

; TYPE: amino acid 

; STRANDEDNESS : not relevant 

; TOPOLOGY: not relevant 

FEATURE : 

; NAME/KEY: Modif ied-site 

; LOCATION: 1 

OTHER INFORMATION: /product= "p-Glu" 

; FEATURE : 

; NAME/KEY: Modif ied-site 

; LOCATION: 10 

; OTHER INFORMATION: /product= "Thr-Amide" 
US-08-747-137-122 

Query Match 70.2%; Score 33; DB 2; Length 10; 

Best Local Similarity 71.4%; Pred. No. 2.4; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 
Qy 2 LNFSTGW 8 



Db 



- I I I II 

2 VNFSPGW 8 



Search completed: October 2, 2003, 08:54:13 
Job time : 32 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein 



protein search, using sw model 



Run on : 



October 2, 



2003, 



08:47:57 



; Search time 39 Seconds 

(without alignments) 

19.727 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-072-419-4 
47 

1 ELNFSTGW 8 



Scoring table: 



BLOSUM62 



Gapop 10,. 0 f Gapext 0 . 5 



Searched: 283308 seqs, 96168682 residues 

Total number of hits satisfying chosen parameters: 283308 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 



Database : 



PIR_76:* 
pirl:^ 
pir2 :^ 
pirS : 
pir4 : ^ 



Fred, No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB 



ID 



Description 



1 


44 


93.6 


61 


1 


AKLQS2 


adipokinetic hormo 


2 


44 


93.6 


61 


1 


B34913 


adipokinetic hormo 


3 


41 


87.2 


8 


2 


A28004 


adipokinetic hormo 


4 


39 


83.0 


61 


1 


AKLQL2 


adipokinetic hormo 


5 


38 


80. 9 


8 


2 


A61348 


red pigment-concen 


6 


38 


80. 9 


110 


1 


JN0761 


red pigment-concen 


7 


38 


80.9 


282 


2 


T29279 


hypothetical prote 


8 


38 


80. 9 


371 


2 


T24853 


hypothetical prote 


9 


36 


76. 6 


315 


2 


T25171 


hypothetical prote 


10 


36 


76.6 


331 


2 


T28948 


hypothetical prote 


11 


36 


76.6 


341 


2 


T21400 


hypothetical prote 


12 


36 


76. 6 


346 


2 


T24567 


hypothetical prote 


13 


36 


76.6 


346 


2 


T24568 


hypothetical prote 


14 


35 


74.5 


10 


2 


A60421 


hypertrehalosemic 


15 


35 


74.5 


10 


2 


S08997 


hypertrehalosemic 


16 


35 


74 . 5 


10 


2 


S08998 


hypertrehalosemic 


17 


35 


74 . 5 


10 


2 


A26381 


hypertrehalosemic 


18 


35 


74.5 


301 


2 


A88931 


protein R11G11.15 


19 


35 


74.5 


336 


2 


T34162 


hypothetical prote 


20 


35 


74.5 


349 


2 


S74499 


phosphoglycerate m 


21 


35 


74.5 


370 


2 


H89971 


hypothetical prote 


22 


35 


74 . 5 


747 


2 


H82943 


hypothetical prote 


23 


35 


74.5 


915 


2 


T21773 


hypothetical prote 


24 


35 


74.5 


927 


2 


T21772 


hypothetical prote 


25 


34 


72.3 


10 


2 


A31571 


hypertrehalosemic/ 


26 


34 


72.3 


42 


2 


C39853 


alkanal monooxygen 


27 


34 


72 . 3 


108 


2 


T29281 


hypothetical prote 


28 


34 


72.3 


287 


2 


T29280 


hypothetical prote 


29 


34 


72.3 


339 


2 


A83539 


hypothetical prote 



30 


34 


72 . 3 


346 


1 


JH0387 


alkanal monooxygen 


31 


34 


72 . 3 


354 


2 


JQ0413 


alkanal monooxygen 


32 


34 


72 . 3 


412 


2 


T40295 


fructosyl amine - 


33 


34 


72.3 


465 


2 


T16618 


hypothetical prote 


34 


34 


72 . 3 


1090 


2 


C86450 


F5D14.27 protein - 


35 


33 


70.2 


168 


2 


T17790 


hypothetical prote 


36 


33 


70,2 


276 


2 


T50896 


hypothetical membr 


37 


33 


70,2 


294 


2 


JQ0798 


nucleoside-specif i 


38 


33 


70.2 


294 


2 


H90686 


nucleoside-specif i 


39 


33 


70,2 


294 


2 


D85537 


hypothetical prote 


40 


33 


7 0,2 


396 


2 


AH3375 


serine-type D-Ala- 


41 


33 


70.2 


420 


2 


T06000 


aspartic proteinas 


42 


33 


7 0.2 


438 


2 


T04800 


CDP-diacylglycerol 


43 


33 


70.2 


482 


2 


S42535 


alanine transamina 


44 


33 


70.2 


521 


2 


T11710 


probable amino aci 


45 


33 


70.2 


522 


2 


T39281 


ars binding protei 


46 


33 


70.2 


538 


2 


S657 64 


chitinase {EC 3.2. 


47 


33 


70.2 


559 


2 


C72732 


probable medium- ch 


48 


33 


70.2 


591 


1 


SYBYKT 


lysine-tRNA ligase 


49 


33 


70.2 


627 


2 


S67257 


proline transport 


50 


33 


7 0.2 


682 


2 


F81332 


probable periplasm 


51 


33 


70.2 


3433 


1 


S28381 


utrophin - human 


52 


32 


68 , 1 


8 


2 


A4 3 9 7 6 


hypertrehalosemic 


53 


32 


68 . 1 


8 


2 


B43976 


hypertrehalosemic 


54 


32 


68 . 1 


8 


2 


S55310 


adipokinetic hormo 


55 


32 


68 . 1 


8 


2 


A58620 


adipokinetic hormo 


56 


32 


68 . 1 


8 


2 


A05169 


neuropeptide M-I - 


57 


32 


68 . 1 


82 


2 


E82754 


hypothetical prote 


58 


32 


68 , 1 


110 


2 


S26167 


capsid assembly pr 


59 


32 


68 . 1 


115 


2 


T36887 


hypothetical prote 


60 


32 


68 . 1 


214 


2 


T11966 


lipoate biosynthes 


61 


32 


68 . 1 


339 


2 


AC2 153 


hypothetical prote 


62 


32 


68 . 1 


364 


2 


A82 8 60 


DNA replication an 


63 


32 


68 . 1 


366 


. 2 


T21166 


hypothetical prote 


64 


32 


68 . 1 


374 


2 


G88 955 


protein K04F1.6 [i 


65 


32 


68 . 1 


402 


2 


C97132 


hypothetical prote 


66 


32 


68 . 1 


429 


2 


AI2611 


Sun protein [impor 


67 


32 


68 . 1 


429 


2 


G97393 


sun-family protein 


68 


32 


68 . 1 


430 


2 


H83178 


conserved hypothet 


69 


32 


68 . 1 


445 


2 


C69233 


lysyl endopeptidas 


70 


32 


68 , 1 


446 


2 


H90507 


amino acid transpo 


71 


32 


68 . 1 


477 


2 


F82200 


cytochrome- c oxida 


72 


32 


68 . 1 


551 


2 


H95301 


FixN3 cytochrome c 


73 


32 


68 . 1 


631 


2 


T33559 


hypothetical prote 


74 


32 


68 . 1 


660 


2 


D91176 


heme utilization/ t 


75 


32 


68 . 1 


660 


2 


E86022 


outer membrane hem 


76 


32 


68 . 1 


714 


2 


T32386 


hypothetical prote 


77 


32 


68 . 1 


785 


2 


T00474 


hypothetical prote 


78 


32 


68 . 1 


813 


2 


A40601 


f erripyoverdine re 


79 


32 


68 . 1 


815 


2 


H83345 


f erripyoverdine re 


80 


32 


68. 1 


868 


2 


T25716 


hypothetical prote 


81 


32 


68. 1 


1024 


2 


T30868 


RhoA-binding prote 


82 


32 


68.1 


1117 


2 


C91286 


hypothetical prote 


83 


32 


68. 1 


1117 


2 


G86127 


hypothetical prote 


84 


32 


68. 1 


1121 


2 


T25715 


hypothetical prote 


85 


32 


68. 1 


1418 


2 


D75281 


ribonucleoside-dip 


86 


32 


68. 1 


1475 


2 


A60026 


cell communication 



87 


32 


68 . 


1 


2123 


2 


S55089 


probable acetyl— Co 


88 


32 


68 , 


1 


2326 


2 


m o o 1 yi n 

Tii y 14 u 


hypothetical prote 


89 


32 


68 . 


1 


2489 


2 


S59782 


probable membrane 


90 


31 


66. 


0 


8 


2 


S21663 


neuropeptide - flo 


91 


31 


66. 


0 


104 


2 


H69261 


hypothetical prote 


92 


31 


66 . 


0 


171 




by / U / 4 


probable acetyltra 


93 


31 


66. 


0 


176 


2 


E95274 


hypothetical prote 


94 


31 


66. 


0 


190 


2 


B84989 


SOS ribosomal prot 


95 


31 


66. 


0 


211 


2 


S43214 


phosphoglycerate m 


96 


31 


66. 


0 


229 


2 


AE1350 


phosphoglyceromuta 


97 


31 


66. 


0 


229 


2 


AH1720 


phosphoglyceromuta 


98 


31 


66. 


0 


287 


2 


AD1388 


conserved hypothet 


99 


31 


66. 


0 


287 


2 


AF1763 


conserved hypothet 


100 


31 


66. 


0 


315 


2 


E83064 


hypothetical prote 



ALIGNMENTS 



RESULT 1 

AKLQS2 

adipokinetic hormone II precursor - desert locust 

N;Alternate names: adipokinetic hormone II precursor-related peptide; 
adipokinetic hormone precursor-related peptide beta chain; APRP beta chain; Scg~ 
AKH-II 

N; Contains: adipokinetic hormone II; adipokinetic hormone-associated peptide II 
C; Species: Schistocerca gregaria (desert locust) 

C;Date: 31-Mar-1988 #sequence_revision 14-Nov-1997 #text_change 20-Mar-1998 
C;Accession: A58662; A44822; A25204; A24241 

R;Fischer-Lougheed, J.; O'Shea, M. ; Cornish, I.; Losberger, C; Roulet, E. ; 

Schulz-Aellen, M. F. 

J. Exp. Biol. 177, 223-241, 1993 

A; Title: AKH biosynthesis: transcriptional and translational control of two co- 
localised prohormones . 
A; Reference number: A58662 
A;Accession: A58662 
A; Molecule type: mRNA 
A; Residues: 1-61 <FIS> 

R;Hekimi, S.; Fischer-Lougheed, J.; O'Shea, M, 
J. Neurosci. 11, 3246-3256, 1991 

A; Title: Regulation of neuropeptide stoichiometry in neurosecretory cells. 
A; Reference number: A44822; MUID : 92044772 ; PMID: 1941082 

A;Accession: A44822 

A;Molecule type: protein 

A;Residues: ' E \ 24-45 T ', 47-61 <HEK> 

A; Experimental source: neurosecretory cells, corpora cardiaca 
A;Note: sequence extracted from NCBI backbone (NCBIP : 63239 ) 
R;Siegert, K. ; Morgan, P.; Mordue, W. 
Biol. Chem. Hoppe-Seyler 366, 723-727, 1985 

A; Title: Primary structures of locust adipokinetic hormones II. 
A; Reference number: A90692; MUID: 86050918 ; PMID: 4063072 
A;Accession: A25204 
A;Molecule type: protein 
A; Residues: 23-30 <SIE> 

R;Gade, G. ; Goldsworthy, G.J.; Schaffer, M.H.; Cook, J-C; Rinehart Jr./ K.L. 
Biochem. Biophys. Res. Commun. 134, 723-730, 1986 



A; Title: Sequence analyses of adipokinetic hormones II from corpora cardiaca of 
Schistocerca nitans^ Schistocerca gregaria,, and Locusta migratoria by fast atom 
bombardment mass spectrometry. 

A; Reference number: A24241; MUID: 86130555; PMID: 3947348 
A;Accession: A24241 
A;Molecule type: protein 
A; Residues: 'E', 24-30 <GAD> 

C; Complex: forms disulfide linked homodimers; disulfide linked heterodimers form 
between adipokinetic hormone-associated peptides I and II 
C; Function: 

A; Description: hormone, released from the corpora cardiaca after the beginning 
of flight, causes release of diglycerides from the fat body and then stimulates 
the flight muscles to use diglycerides as an energy source 
C; Super family : adipokinetic hormone 

C;Keywords: amidated carboxyl end; corpora cardiaca; heterodimer; homodimer; 

hormone; neuropeptide; pyroglutamic acid 

F; 1-22/Domain: signal sequence #status predicted <SIG> 

F;23-30/Product : adipokinetic hormone II #status experimental <MAT1> 

F; 34-61/Product : adipokinetic hormone-associated peptide II #status experimental 

<MAT2> 

F;23/Modified site: pyrrolidone carboxylic acid (Gin) (in mature form) #status 

experimental 

F; 30/Modif led site: amidated carboxyl end (Trp) (amide in mature form from 
following glycine) #status experimental 

F; 59/Disulf ide bonds: interchain (partial) #status experimental 

F; 59/Disulf ide bonds: interchain (to adipokinetic hormone-associated peptide I 
59) (partial) #status experimental 

Query Match 93.6%; Score 44; DB 1; Length 61; 

Best Local Similarity 87.5%; Pred. No, 0,095; 

Matches 7; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ELNFSTGW 8 

: 1 I I I I I I 
Db 23 QLNFSTGW 30 



RESULT 3 
A28004 

adipokinetic hormone G - two-spotted cricket 
N;Alternate names: AKH-G 

C; Species: Gryllus bimaculatus (two-spotted cricket) 

C;Date: 30-Jun-1989 #sequence_revision 24-Oct-1997 #text_change 24-Oct-1997 
C; Accession: A280 04 
R;Gaede, G. ; Rinehart, K.L. 

Biochem. Biophys . Res. Commun. 149, 908-914, 1987 

A; Title: Primary sequence analysis by fast atom bombardment mass spectrometry of 
a peptide with adipokinetic activity from the corpora cardiaca of the cricket 

Gryllus bimaculatus . 

A;Reference number: A28004; MUID: 88106553; PMID:3426616 
A;Accession: A28004 
A;Molecule type: protein 
A; Residues: 1-8 <GAE> 

A; Note: the amino-terminal residue forms pyrrolidone carboxylic acid; therefore, 

we have shown it as Gin 

C; Superfamily : adipokinetic hormone 



C; Keywords: amidated carboxyl end; corpora cardiaca; hormone; neuropeptide; 
pyroglutamic acid 

F;l/Modified site: pyrrolidone carboxylic acid (Gin) #status experimental 
F;8/Modified site; amidated carboxyl end (Trp) #status experimental 

Query Match 87.2%; Score 41; DB 2 ; Length 8; 

Best Local Similarity 75.0%; Pred. No. 2.8e+05; 

Matches 6; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 ELNFSTGW 8 

:: I 1 I I 1 1 
Db 1 QVNFSTGW 8 

Search completed: October 2, 2003, 08:53:39 
Job time : 4 4 sees 

GenCore version 5.1.6 
Copyright (c) 1993 ~ 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 2, 2003, 08:40:31 ; Search time 23 Seconds 

(without alignments) 
16.357 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-072-419-4 
47 

1 ELNFSTGW 8 



Scoring table: BLOSUM62 

Gapop 10-0 , Gapext 0.5 



Searched: 



127863 seqs, 47026705 residues 



Total number of hits satisfying chosen parameters: 



127863 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 

Database : SwissProt_4 1 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 44 93.6 39 1 AKH2 SCHGR 



P35808 schistocerc 



2 


44 


93. 


6 


61 


1 


AKH2 SCHNI 


P35807 


schis tocerc 


3 


41 


87. 


2 


8 


1 


AKHG_GRYBI 


P14086 


gryllus bim 


4 


39 


83. 


0 


61 


1 


AKH2_L0CMI 


P08379 


locusta mig 


5 


38 


80. 


9 


8 


1 


RPCH_PANBO 


P08939 


pandalus bo 


6 


38 


80. 


9 


109 


1 


RPCH__CALSI 


Q23757 


callinectes 


7 


38 


80. 


9 


110 


1 


RPCH_CARMA 


Q26324 


carcinus ma 


8 


35 


74. 


5 


10 


1 


HTF_NAUCI 


P10939 


nauphoeta c 


9 


35 


74. 


5 


72 


1 


HTF_BLADI 


Q17128 


blaberus di 


10 


35 


74. 


5 


747 


1 


Y030_UREPA 


Q9prb5 


ureaplasma 


11 


34 


72. 


3 


10 


1 


HTF_HELZE 


P16353 


heliothis z 


12 


34 


72. 


3 


346 


1 


LUXA_PHOPO 


P24113 


photobacter 


13 


34 


72. 


3 


354 


1 


LXA1_PH0LE 


P09140 


photobacter 


14 


33 


70. 


2 


214 


1 


IiAZA__SCHAM 


P49291 


schistocerc 


15 


33 


70. 


2 


294 


1 


TSX_ECOLI 


P22786 


escherichia 


16 


33 


70. 


2 


482 


1 


ALA2_H0RVU 


P52894 


hordeum vul 


17 


33 


70. 


2 


521 


1 


YCV4 SCHPO 


074543 


schizosacch 


18 


33 


70. 


2 


522 


1 


ABP1_SCHP0 


P49777 


schizosacch 


19 


33 


70. 


2 


590 


1 


SYKC_YEAST 


P15180 


saccharomyc 


20 


33 


70. 


2 


627 


1 


PUT4__YEAST 


P15380 


saccharomyc 


21 


33 


70. 


2 


3433 


1 


UTRO_HUMAN 


P46939 


homo sapien 


22 


32 


68 . 


1 


8 


1 


HTF TENMO 


P25419 


tenebrio mo 


23 


32 


68. 


1 


110 


1 


Y12M_BPT4 


Q00305 


bacteriopha 


24 


32 


68 . 


1 


214 


1 


LIPB_CYACA 


019898 


cyanidium c 


25 


32 


68. 


1 


232 


1 


GPMA__BUCBP 


Q89aj4 


buchnera ap 


26 


32 


68. 


1 


364 


1 


RECF_XYLFA 


Q9phel 


xylella fas 


27 


32 


68 . 


1 


593 


1 


TARA HUMAN 


Q9h2d6 


homo sapien 


28 


32 


68 . 


1 


815 


1 


FPVA_PSEAE 


P48632 


pseudomonas 


29 


32 


68 . 


1 


1024 


1 


RIP3 MOUSE 


P97434 


mus musculu 


30 


32 


68 . 


1 


1029 


1 


RIP3_RAT 


Q9ere6 


rattus norv 


31 


32 


68, 


1 


1475 


1 


TRA2_CAEEL 


P34709 


caenorhabdi 


32 


32 


68 . 


1 


2273 


1 


HFA1_YEAST 


P32874 


saccharomyc 


33 


31 


66, 


0 


104 


1 


Y096_ARCFU 


030140 


archaeoglob 


34 


31 


66. 


0 


178 


1 


RL6 BUCAI 


P57576 


buchnera ap 


35 


31 


66. 


0 


211 


1 


PMGY_SCHPO 


P36623 


schizosacch 


36 


31 


66. 


0 


229 


1 


GPMA_LISIN 


Q929g8 


listeria in 


37 


31 


66. 


0 


229 


1 


GPMA_LISMO 


Q8y571 


listeria mo 


38 


31 


66. 


0 


321 


1 


ABNA_ASPNG 


P42256 


aspergillus 


39 


31 


66. 


0 


338 


1 


YP7 9_CAEEL 


Q09439 


caenorhabdi 


40 


31 


66. 


0 


463 


1 


VNFK_ANAVA 


Q57302 


anabaena va 


41 


31 


66. 


0 


494 


1 


G6PD_ACTAC 


P77809 


actinobacil 


42 


31 


66. 


0 


494 


1 


G6PD_HAEIN 


P44311 


haemophilus 


43 


31 


66. 


0 


512 


1 


EMRY_ECOLI 


P52600 


escherichia 


44 


31 


66. 


0 


513 


1 


PDI HORVU 


P80284 


hordeum vul 


45 


31 


66. 


0 


513 


1 


PDI MAIZE 


P52588 


zea mays (m 


46 


31 


66. 


0 


515 


1 


PDI WHEAT 


P52589 


triticum ae 


47 


31 


66. 


0 


517 


1 


GUNA CLOLO 


P54937 


Clostridium 


48 


31 


66. 


0 


676 


1 


HMUR YERPE 


Q56989 


yersinia pe 


49 


31 


66. 


0 


, 677 


1 


VGP_EBORE 


Q66799 


ebola virus 


50 


31 


66. 


0 


677 


1 


VGP EBORS 


Q89853 


ebola virus 


51 


31 


66. 


0 


900 


1 


SYA_SULSO 


P96041 


sulf olobus 


52 


31 


66. 


0 


958 


1 


HIG DROME 


Q09101 


drosophila 


53 


31 


66. 


0 


1699 


1 


POLN LORDV 


P54634 


lordsdale v 


54 


31 


66. 


0 


1704 


1 


CED7_CAEEL 


P34358 


caenorhabdi 


55 


31 


66. 


0 


2280 


1 


COAC SCHPO 


P78820 


schizosacch 


56 


31 


66. 


0 


2710 


1 


TOXA__CLODI 


P16154 


Clostridium 


57 


31 


66. 


0 


3680 


1 


DMD_CANFA 


097592 


canis famil 


58 


31 


66.0 


3685 


1 


DMD HUMAN 


P11532 


homo sapien 



59 


30 


63. 


8 


54 


1 


ATP8 ASTPE 


Q33822 


asterina pe 


60 


30 


63 . 


8 


108 


1 


YGGL__ECOLI 


P38521 


escherichia 


61 


30 


63. 


8 


121 


1 


RNPA_COXBU 


P45648 


coxiella bu 


62 


30 


63. 


8 


157 


1 


MENG__MyCTU 


P96224 


mycobacteri 


63 


30 


63. 


8 


192 


1 


SODF BORPE 


P37369 


bordetella 


64 


30 


63 . 


8 


192 


1 


SODF_ECOLI 


P09157 


escherichia 


65 


30 


63 . 


8 


192 


1 


SODF SALTY 


P40726 


salmonella 


66 


30 


63. 


8 


213 


1 


YE54_HAEIN 


P44202 


haemophilus 


67 


30 


63. 


8 


234 


1 


GU33_RAT 


P35896 


rattus norv 


68 


30 


63. 


8 


248 


1 


PAAC_ECOLI 


P76079 


escherichia 


69 


30 


63 . 


8 


249 


1 


GPMA MYCTU 


Q1114 0 


mycobacteri 


70 


30 


63 . 


8 


251 


1 


GPMA TREPA 


P9612 1 


treponema p 


71 


30 


63 , 


8 


252 


1 


AGL6 ARATH 


P29386 


arabidopsis 


72 


30 


63. 


8 


290 


1 


YPHB___ECOLI 


P76584 


escherichia 


73 


30 


63 . 


8 


312 


1 


GU2 7 RAT 


P34987 


rattus norv 


74 


30 


63 . 


8 


401 


1 


CBPB PIG 


P09955 


sus scrofa 


75 


30 


63 . 


8 


415 


1 


CBPB RAT 


P19223 


rattus norv 


76 


30 


63 . 


8 


460 


1 


YS85 MYCTU 


Q10809 


mycobacteri 


77 


30 


63. 


8 


467 


1 


P0F9 SCHPO 


074381 


schizosacch 


78 


30 


63 . 


8 


476 


1 


OSTA YEAST 


P41543 


saccharomyc 


79 


30 


63 . 


8 


477 


1 


YZ64 SYNY3 


P73436 


synechocys t 


80 


30 


63 . 


8 


481 


1 


SYP CLOST 


Q914 q8 


cl o s t ri di um 


81 


30 


63 . 


8 


511 


1 


DHAY_YEAST 


P32872 


saccharomyc 


82 


30 


63. 


8 


630 


1 


S6A4_BOVIN 


Q9xt49 


bos taurus 


83 


30 


63. 


8 


630 


1 


S6A4_CAVPO 


035899 


cavia porce 


84 


30 


63. 


8 


630 


1 


S 6A4 HUMAN 


P31645 


homo sapien 


85 


30 


63 . 


8 


630 


1 


S6A4 MACMU 


Q9myx0 


macaca mula 


86 


30 


63. 


8 


630 


1 


S6A4_MOUSE 


Q60857 


mus musculu 


87 


30 


63. 


8 


630 


1 


S6A4__RAT 


P31652 


rattus norv 


88 


30 


63 . 


8 


636 


1 


FET3_ YEAST 


P38993 


saccharomyc 


89 


30 


63. 


8 


670 


1 


SSM4_SCHP0 


042667 


schizosacch 


90 


30 


63 . 


8 ' 


1336 


1 


MAMl SCHPO 


P78966 


schizosacch 


91 


30 


63 . 


8 


1441 


1 


VGLM__BUNSH 


P04875 


bunya virus 


92 


30 


63 . 


8 


145 5 


1 


MANR HUMAN 


n O O O Q T 


homo sapien 


93 


30 


63. 


8 


1658 


1 


ITN2 MOUSE 


Q9z0r6 


mus musculu 


94 


30 


63. 


8 


1711 


1 


PTPO RAT 


Q64612 


rattus norv 


95 


30 


63. 


8 


1927 


1 


LPH HUMAN 


P09848 


homo sapien 


96 


30 


63. 


8 


2493 


1 


YBA4 YEAST 


P35194 


saccharomyc 


97 


30 


63. 


8 


3343 


1 


Y0G7 CAEEL 


P34616 


caenorhabdi 


98 


30 


63. 


8 


4590 


1 


FATH HUM7VN 


Q14517 


homo sapien 


99 


29 


61. 


7 


8 


1 


AKH TABAT 


P14595 


tabanus atr 


100 


29 


61, 


7 


8 


1 


HTFl PERAM 


P04548 


periplaneta 



ALIGNMENTS 



RESULT 1 
AKH2_SCHGR 

ID AKH2_SCHGR STANDARD; PRT; 39 AA. 

AC P35808; P08378; 

DT Ol-AUG-1988 (Rel. 08, Created) 

DT Ol-JUN-1994 (Rel. 29, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Adipokinetic hormone II precursor [Contains : Adipokinetic hormone II 
DE (AKH-II) ; Adipokinetic hormone precursor-related peptide beta chain 
DE (APRP-beta) ] . 



OS Schistocerca gregaria (Desert locust) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Orthopteroidea; Orthoptera; Caelifera; Acridomorpha; 

OC Acridoidea; Acrididae; Cyrtacanthacridinae; Schistocerca. 

OX NCBI_TaxID=7 0lO; 

RN [1] 

RP SEQUENCE, AND PROCESSING. 

RX MEDLINE=92044772; PubMed=1941082 ; 

RA Hekimi S., Fischer-Lougheed J., O'Shea M. ; 

RT "Regulation of neuropeptide stoichiometry in neurosecretory cells." 

RL J. Neurosci. 11:3246-3256(1991). 

RN [2] 

RP SEQUENCE OF 1-8 . 

RX MEDLINE=86050918; PubMed=4 063072 ; 

RA Siegert K., Morgan P.,- Mordue W,; 

RT "Primary structures of locust adipokinetic hormones II."; 

RL Biol. Chem. Hoppe-Seyler 366:72 3-727(198 5). 

CC FUNCTION; THIS HORMONE, RELEASED FROM CELLS IN THE CORPORA 

CC CARDIACA AFTER THE BEGINNING OF FLIGHT, CAUSES RELEASE OF 

CC DIGLYCERIDES FROM THE FAT BODY AND THEN STIMULATES THE FLIGHT 

CC MUSCLES TO USE THESE DIGLYCERIDES AS AN ENERGY SOURCE. 

CC -!- SUBUNIT: ADIPOKINETIC HORMONE PRECURSOR- RELATED PEPTIDE (APRP) 

CC CAN FORM THREE TYPE OF DI SULFIDE-BOND DIMERS : Pi (ALPHA-ALPHA), 

CC P2 (ALPHA-BETA), AND P3 ( BETA- BETA ) . 

CC -!- SIMILARITY: BELONGS TO THE AKH / HRTH / RPCH FAMILY. 

DR InterPro; IPR002047; AKH. 

DR PROSITE; PS00256; AKH; 1. 

KW Neuropeptide; Amidation; Flight; Cleavage on pair of basic residues 

KW Pyrrolidone carboxylic acid. 

FT PEPTIDE 1 8 ADIPOKINETIC HORMONE II. 

FT PEPTIDE 12 39 ADIPOKINETIC HORMONE PRECURSOR- RELATED 

FT PEPTIDE BETA CHAIN. 

FT MOD__RES 1 1 PYRROLIDONE CARBOXYLIC ACID. 

FT MOD_RES 8 8 AMIDATION (G-9 PROVIDE AMIDE GROUP) . 

FT DISULFID 37 37 INTERCHAIN, 

SQ SEQUENCE 39 AA; 4371 MW; DB80C45CE57310C3 CRC64 ; 

Query Match 93.6%; Score 44; DB 1; Length 39; 

Best Local Similarity . 87.5%; Pred. No. 0.042; 

Matches 7; Conservative 1; Mismatches 0; Indels 0; Gap 



Qy 1 ELNFSTGW 8 

: I I M I I I 
Db 1 QLNFSTGW 8 



RESULT 3 
AKHG_GRYBI 

ID AKHG_GRYBI STANDARD; PRT; 8 AA. 

AC P14086; 

DT Ol-JAN-1990 (Rel. 13, Created) 

DT Ol-FEB-1994 (Rel. 28, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Adipokinetic hormone G (AKH-G) (RO II) . 

OS Gryllus bimaculatus (Two-spotted cricket), and 

OS Romalea microptera (Lubber grasshopper) . 

OC Eukaryota ; Metazoa ; Arthropoda ; Hexapoda ; Insecta ; Pterygota ; 



OC Neoptera; Orthopteroidea; Orthoptera; Ensifera; Gryllidae; Gryllinae; 

OC Gryllus. 

OX NCBI_TaxID=6999, 7007; 

RN [1] 

RP SEQUENCE. 

RC SPECIES=G.bimaculatus; TISSUE=Corpora cardiaca; 

RX MEDLINE=8 8 106553; PubMed=3426616; 

RA Gaede G., Rinehart K.L. Jr.; 

RT "Primary sequence analysis by fast atom bombardment mass spectrometry 

RT of a peptide with adipokinetic activity from the corpora cardiaca of 

RT the cricket Gryllus bimaculatus . " ; 

RL Biochem. Biophys . Res. Commun. 149:908-914(1987). 

RN [2] 

RP SEQUENCE. 

RC SPECIES=R.microptera; TISSUE==Corpora cardiaca; 

RX MEDLINE=89145002; PubMed=322694 8 ; 

RA Gaede G. , Hilbich C, Beyreuther K. , Rinehart K.L. Jr.; 

RT "Sequence analyses of two neuropeptides of the AKH/RPCH- family from 

RT the lubber grasshopper, Romalea microptera . " ; 

RL Peptides 9:681-688(1988) . 

CC FUNCTION: THIS HORMONE, RELEASED FROM CELLS IN THE CORPORA 

CC CARDIACA AFTER THE BEGINNING OF FLIGHT, CAUSES RELEASE OF 

CC DIGLYCERIDES FROM THE FAT BODY AND THEN STIMULATES THE FLIGHT 

CC MUSCLES TO USE THESE DIGLYCERIDES AS AN ENERGY SOURCE. 

CC SIMILARITY: BELONGS TO THE AKH / HRTH / RPCH FAMILY. 

DR PIR; A28004; A28004. 

DR InterPro; IPR002047; AKH. 

DR PROSITE; FS00256; AKH; 1. 

KW Neuropeptide; Amidation; Flight; Pyrrolidone carboxylic acid. 

FT MOD_RES 1 1 PYRROLIDONE CARBOXYLIC ACID. 

FT MOD_RES 8 8 7\MIDATI0N. 

SQ SEQUENCE 8 AA; 938 MW; 867861B5B9C452D6 CRC64; 



Query Match 87.2%; Score 41; DB 1; Length 8; 

Best Local Similarity 75.0%; Pred. No. l,3e+05; 

Matches 6; Conservative 2; Mismatches 0; Indels 0; Gaps 

Qy 1 ELNFSTGW 8 

:: I I I I I I 
Db 1 QVNFSTGW 8 



Search completed: October 2, 2003, 08:51:11 
Job time : 28 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 2, 2003, 08:47:39 ; Search time 95 Seconds 

(without alignments) 
21.731 Million cell updates/sec 



Title: 



US-10-072-419-4 



Perfect score: 
Sequence: 



47 

1 ELNFSTGW 8 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 830525 seqs, 258052604 residues 

Total number of hits satisfying chosen parameters: 830525 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 

Database : SPTREMBL 23:* 



1: 


sp_archea : * ■ 


2: 


sp_bacteria : * 


3: 


sp fungi : * 


4: 


sp human : 


5: 


sp invertebrate:* 


6: 


sp_mammal : * 


7: 


sp_mhc: * 


8: 


sp organelle:* 


9: 


sp phage : * 


10: 


sp plant:* 


11: 


sp^rodent : * 


12: 


sp virus:* 


13: 


sp vertebrate:* 


14: 


sp unclassified:* 


15: 


sp rvirus:* 


16: 


sp bacteriap:* 


17: 


sp_ar cheap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


38 


80. 9 


332 


5 


Q18441 


Q18441 caenorhabdi 


2 


38 


80.9 


371 


5 


Q22420 


Q22420 caenorhabdi 


3 


36 


76. 6 


315 


5 


018121 


018121 caenorhabdi 


4 


36 


76.6 


331 


5 


P91217 


P91217 caenorhabdi 


5 


36 


76. 6 


341 


5 


062184 


062184 caenorhabdi 


6 


36 


76. 6 


346 


5 


018042 


018042 caenorhabdi 


7 


36 


76.6 


346 


5 


018041 


018041 caenorhabdi 


8 


36 


76.6 


525 


10 


Q9ZNW2 


Q9znw2 physcomitre 


9 


36 


76.6 


569 


3 


Q9C0V5 


Q9c0v5 schizosacch 


10 


36 


76.6 


663 


12 


Q8 JMll 


Q8jmll mamestra co 


11 


36 


76.6 


666 


12 


Q8QL90 


Q8ql90 mamestra co 


12 


35 


74.5 


124 


16 


Q8XX03 


Q8xx03 ralstonia s 



13 


35 


74 . 


5 


183 


4 


043923 


043923 homo sapien 


14 


35 


74 . 


5 


324 


2 


O07078 


007078 bacillus su 


15 


35 


74 . 


5 


344 


5 


Q18571 


Q18571 caenorhabdi 


16 


35 


74 . 


5 


349 


16 


P72649 


P72649 synechocyst 


17 


35 


74 . 


5 


370 


16 


Q99T29 


Q99t29 staphylococ 


18 


35 


74 . 


5 


370 


16 


Q8NW2 


Q8nvv2 staphylococ 


19 


35 


74 . 


5 


444 


2 


Q9S093 


Q9s093 borrelia bu 


20 


35 


74 . 


5 


927 


5 


002364 


002364 caenorhabdi 


21 


35 


74 . 


5 


1702 


12 


Q8 JXI5 


Q8jxi5 norwalk-lik 


22 


35 


74 . 


5 


1702 


12 


Q8JXI4 


Q8jxi4 norwalk-lik 


23 


34 


72 . 


3 


69 


2 


Q8KNR4 


Q8knr4 bacillus th 


24 


34 


72 . 


3 


324 


3 


Q8TGR8 


Q8t:gr8 saccharomyc 


25 


34 


72 . 


3 


339 


16 


Q9I586 


Q9i58 6 pseudomonas 


26 


34 


72 . 


3 


348 


16 


Q98 JG7 


Q98jg7 rhizobium 1 


27 


34 


72 . 


3 


357 


2 


Q52098 


Q52098 pnotobacter 


28 


34 


72 . 


3 


412 


3 


043029 


O43029 schizosacch 


29 


34 


72 . 


3 


465 


5 


Q21433 


Q21433 caenorhabdi 


30 


34 


72 . 


3 


7 02 


16 


Q8EI46 


Q8ei46 shewanella 


31 


34 


72 • 


3 


750 


16 


Q8XJ85 


Q8xj85 Clostridium 


32 


34 


72 . 


3 


1044 


10 


Q8VY00 


Q8vy00 arabidopsis 


33 


34 


72 . 


3 


1090 


10 


Q9LQK8 


Q91qk8 arabidopsis 


34 


34 


72 , 


3 


1524 


16 


Q8FSZ4 


Q8fsz4 corynebacte 


35 


34 


72 . 


3 


1884 


4 


Q9ULD7 


Q9uld7 homo sapien 


36 


34 


72. 


3 


1885 


4 


Q8IZ J3 


Q8izj3 homo sapien 


37 


33 


70 . 


2 


168 


12 


Q84609 


Q84 609 Paramecium 


38 


33 


70 . 


2 


181 


16 


Q8Y0R3 


Q8y0r3 ralstonia s 


39 


33 


70 . 


2 


227 


17 


Q96YS6 


Q96ys6 sulfolobus 


40 


33 


70 . 


2 


245 


8 


Q8WF52 


Q8wf52 venerupis p 


41 


33 


70 . 


2 


269 


11 


Q8C7R5 


Q8c7r5 mus musculu 


42 


33 


70. 


2 


276 


2 


Q9 JPB2 


Q9jpb2 rhodocyclus 


43 


33 


70 . 


2 


310 


16 


Q8CWB3 


Q8cwb3 escherichia 


44 


33 


70 . 


2 


374 


13 


Q8AXQ0 


Q8axq0 gallus gall 


45 


33 


7 0 . 


2 


377 


16 


Q8CRX7 


Q8crx7 staphylococ 


46 


33 


70 . 


2 


396 


16 


Q8YH14 


Q8yhl4 brucella me 


47 


33 


70 , 


2 


396 


16 


Q8G0U4 


Q8g0u4 brucella su 


48 


33 


70 . 


2 


413 


10 


Q8GWM0 


Q8gwmO arabidopsis 


49 


33 


70 . 


2 


420 


10 


Q9SZC6 


Q9szc6 arabidopsis 


50 


33 


7 0 . 


2 


425 


10 


Q8LCW1 


Q81cwl arabidopsis 


51 


33 


7 0 . 


2 


432 


17 


Q973P1 


Q973pl sulfolobus 


52 


33 


70 . 


2 


438 


10 


Q9SZ17 


Q9szl7 arabidopsis 


53 


33 


70 . 


2 


483 


10 


Q9S768 


Q9s768 oryza sativ 


54 


33 


70. 


2 


484 


10 


Q94HC5 


Q94hc5 oryza sativ 


55 


33 


7 0 . 


2 


538 


2 


Q59143 


Q59143 aeromonas s 


56 


33 


70 . 


2 


550 


3 


Q8 J266 


Q8j266 hebeloma cy 


57 


33 


70 . 


2 


559 


17 


Q9YF45 


Q9yf45 aeropyrum p 


58 


33 


70 . 


2 


652 


5 


Q9NDQ1 


Q9ndql ciona intes 


59 


33 


70. 


2 


682 


16 


Q9PN45 


Q9pn45 campylobact 


60 


33 


70 . 


2 


706 


16 


Q8NMH7 


Q8nmh7 corynebacte 


61 


33 


70. 


2 


749 


8 


Q9TL66 


Q9tl66 helwingia j 


62 


33 


70. 


2 


800 


16 


Q8CX57 


Q8cx57 oceanobacil 


63 


33 


70. 


2 


817 


10 


Q9LLF1 


Q911fl oryza sativ 


64 


33 


70. 


2 


942 


10 


Q8S2C0 


Q8s2cO oryza sativ 


65 


33 


70. 


2 


3121 


13 


042269 


042269 brachydanio 


66 


33 


70. 


2 


3202 


12 


Q8QY01 


Q8qy01 bean common 


67 


33 


70. 


2 


3222 


12 


Q8QY00 


Q8qy0 0 bean common 


68 


33 


70. 


2 


3419 


11 


055147 


055147 rattus norv 


69 


33 


70. 


2 


3429 


11 


008614 


O08614 mus musculu 



70 


32.5 


69 


. 1 


439 


12 


Q9DHQ9 


Q9dhq9 yaba-like d 


71 


32 


68 


■ 1 


82 


16 


Q9PF19 


Q9pfl9 xylella fas 


72 


32 


68 


. 1 


97 


17 


Q9HLC5 


Q9hlc5 thermoplasm 


73 


32 


68 


. 1 


99 


15 


009983 


O09983 human irrunun 


74 


32 


68 


. 1 


100 


17 


Q978U6 


Q97 8u6 thermoplasm 


75 


32 


68 


. 1 


110 


9 


Q94MB9 


Q94mb9 bacteriopha 


76 


32 


68 


. 1 


110 


9 


Q94LZ7 


Q941z7 bacteriopha 


77 


32 


68 


. 1 


110 


9 


Q94MG5 


Q94mg5 bacteriopha 


78 


32 


68 


• 1 


110 


9 


Q94LY9 


Q941y9 bacteriopha 


79 


32 


68 


. 1 


110 


9 


Q94LZ6 


Q941z6 bacteriopha 


80 


32 


68 


. 1 


110 


9 


Q94MG4 


Q94mg4 bacteriopha 


81 


32 


68 


. 1 


110 


9 


Q94MC2 


Q94mc2 bacteriopha 


82 


32 


68 


. 1 


110 


9 


Q94MC1 


Q94mcl bacteriopha 


83 


32 


68 


. 1 


115 


16 


Q9S216 


Q9s216 streptomyce 


84 


32 


68 


. 1 


158 


13 


Q8UWB1 


Q8uwbl xenopus lae 


85 


32 


68 


. 1 


183 


15 


Q9EFC5 


Q9efc5 human immun 


86 


32 


68 


. 1 


196 


6 


Q8MJ3 6 


Q8mj3 6 canis famil 


87 


32 


68 


. 1 


217 


16 


Q8ESH7 


Q8esh7 oceanobacil 


88 


32 


68 


. 1 


236 


11 


070158 


070158 cricetulus 


89 


32 


68 


. 1 


239 


10 


Q8H312 


Q8h312 oryza sativ 


90 


32 


68 


. 1 


241 


16 


Q8CUI8 


Q8cui8 oceanobacil 


91 


32 


68 


. 1 


242 


9 


Q94ML3 


Q94ml3 bacteriopha 


92 


32 


68 


. 1 


277 


4 


Q9Y3P2 


Q9y3p2 homo sapien 


93 


32 


68 


.1 


300 


16 


Q8F4P2 


Q8f4p2 leptospira 


94 


32 


68 


.1 


301 


16 


Q98I00 


Q98i00 rhizobium 1 


95 


32 


68 


. 1 


318 


2 


Q8KLP3 


Q8klp3 streptococc 


96 


32 


68 


. 1 


328 


2 


Q9F7N0 


Q9f7n0 uncultured 


97 


32 


68 


, 1 


328 


15 


Q9EF69 


Q9ef69 human immun 


98 


32 


68 


. 1 


332 


16 


Q8Y1J8 


Q8ylj8 ralstonia s 


99 


32 


68 


.1 


339 


16 


Q9WX36 


Q9wx36 anabaena sp 


100 


32 


68 


.1 


352 


11 


Q8C785 


Q8c7 85 mus musculu 



ALIGNMENTS 



RESULT 1 

Q18441 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 

oc 

OX 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 



Q18441 
Q18441; 
Ol-NOV-1996 
01-OCT--2001 
01-MAR-2003 



PRELIMINARY; 



PRT; 



332 AA. 



Created) 

Last sequence update) 
Last annotation update) 



(TrEMBLrel. 01, 
(TrEMBLrel. 18, 
(TrEMBLrel. 23, 
Hypothetical 38.2 kDa protein. 
C34D4. 8 . 

Caenorhabditis elegans. 

Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; 
Rhabditidae; Peloderinae; Caenorhabditis . 

NCBI__TaxID=6239; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=Bristol N2 ; 

MEDLINE=99069613; PubMed=9851916 ; 
None; 

"Genome sequence of the nematode C. elegans: a platform for 
investigating biology. The C. elegans Sequencing Consortium."; 
Science 282:2012-2018(1998). 



RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bristol N2 ; 

RA Du Z . , Le T . T . ; 

RT "The sequence of C. elegans cosmid C34D4."; 

RL Submitted (MAY-1996) to the EMBL/GenBank/DDB J databases, 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Br.istol N2 / 

RA Waterston R. ; 

RT "Direct Submission,"; 

RL Submitted {SEP-2001) to the EMBL/GenBank/DDB J databases. 

DR EMBL; U58755; AAB00693.2; -. 

DR WormPep; C34D4.8; CE24826. 

DR InterPro; IPR003002; 7TM_chemol, 

DR InterPro; IPR000168; 7TM_nematode . 

DR Pf am; PF01461 ; 7tm_4 ; 1 . 

KW Hypothetical protein . 

SQ SEQUENCE 332 AA; 38186 MW; 998E39D4DD997D04 CRC64; 



Query Match 80.9%; 
Best Local Similarity 62.5%; 
Matches 5 ; Conservative 



Score 38; DB 5 ; Length 332 
Pred. No. 28; 
2 ; Mismatches 1 ; Indels 



Qy 

Db 



1 ELNFSTGW 8 
I : I : III 
274 EMNYQTGW 281 



Search completed: October 2, 2003, 08:52:55 
Job time : 103 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: October 2, 2003, 08:50:47 



; Search time 26 Seconds 

(without alignments) 

48.681 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



US-10-072-419-4 
47 

1 ELNFSTGW 8 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 587654 seqs, 158212981 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



587654 



Post-processing : 



Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 



Database 



Published__Applications_AA: * 

1: /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep:* 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB . pep : * 

4: /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep:* 

5: /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep:* 

6 : / cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB . pep : 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB . pep : * 

8: /cgn2__6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: 

10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep 

11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep 

12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: 

13: /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep 

14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep 

15: /cgn2__6/ptodata/2/pubpaa/USl0C_PUBCOMB,pep 

16: /cgn2_6/ptodata/2/pubpaa/US10_NEW__PUB.pep: 

17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW__PUB.pep: 

18 : /cgn2__6/ptodata/2/pubpaa/US60_PUBCOMB . pep : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB 



ID 



Description 



1 


47 


100 . 


0 


8 


12 


US- 10- 072- 4 19-4 


Sequence 


4, Appli 


2 


47 


100. 


0 


8 


12 


US- 10-072-4 19- 15 


Sequence 


15, Appl 


3 


47 


100. 


0 


8 


12 


US- 10-072-419-25 


Sequence 


25, Appl 


4 


47 


100. 


0 


10 


12 


US- 10-072- 4 19-3 6 


Sequence 


3 6, Appl 


5 


44 


93 . 


6 


8 


12 


US-10- 07 2-41 9-24 


Sequence 


24, Appl 


6 


42 


89 . 


4 


8 


12 


US- 10- 072- 4 19- 11 


Sequence 


11, Appl 


7 


42 


89. 


4 


8 


12 


US- 10- 072-419-30 


Sequence 


30, Appl 


8 


38 


80. 


9 


10 


12 


US-10-072-419-2 


Sequence 


2, Appli 


9 


38 


80. 


9 


463 


15 


US- 10- 15 6- 7 61 -12 632 


Sequence 


12632, A 


10 


37 


78 . 


7 


10 


12 


US-10-072-419-34 


Sequence 


34, Appl 


11 


35 


74 . 


5 


8 


12 


US-10-072-419-5 


Sequence 


5, Appli 


12 


35 


74 . 


5 


8 


12 


US-10-072-419-20 


Sequence 


2 0, Appl 


13 


35 


74 . 


5 


8 


12 


US- 10- 072- 4 19-21 


Sequence 


21, Appl 


14 


35 


74 . 


5 


8 


12 


" US-10-072-419-26 


Sequence 


2 6, Appl 


15 


35 


74 . 


5 


183 


9 


US-09-801-196-33 


Sequence : 


33, Appl 


16 


34 


72 . 


3 


41 


9 


US~0 9- 864-761- 378 64 


Sequence : 


37864, A 


17 


33 


70 . 


2 


706 


10 


US- 09- 7 3 8- 62 6- 6357 


Sequence 


6357, Ap 


18 


32 


68 . 


1 


8 


12 


US-10-072-419-9 


Sequence 


9, Appli 


19 


32 


68 . 


1 


8 


12 


US-10-072-419-12 


Sequence 


12, Appl 


20 


32 


68 . 


1 


8 


12 


US- 10- 072- 4 19- 14 


Sequence 


14, Appl 


21 


32 


68 . 


1 


8 


12 


US~1 0-072-4 19-17 


Sequence 


17, Appl 


22 


32 


68 . 


1 


8 


12 


US- 10-072-4 19- 18 


Sequence 


18, Appl 


23 


32 


68 . 


1 


10 


12 


US-10-072-419-1 


Sequence 


1, Appli 


24 


32 


68 . 


1 


10 


12 


US-10-072-419-32 


Sequence 


32, Appl 


25 


32 


68 . 


1 


10 


12 


US- 10- 07 2- 4 19- 35 


Sequence 


35, Appl 


26 


32 


68 . 


1 


10 


12 


US- 10- 07 2- 4 19- 4 0 


Sequence 


4 0, Appl 


27 


32 


68 . 


1 


72 


11 


US- 09- 8 9 9- 4 9 5- 7 8 


Sequence 


7 8 , Appl 


28 


32 


68 . 


1 


91 


10 


US-09-843-67 6-220 


Sequence 


220, App 


29 


32 


68 . 


1 


91 


11 


US-09-4 38-4 8 6-220 


Sequence 


220, App 


30 


32 


68 . 


1 


91 


15 


US- 10- 053-758-220 


Sequence 


22 0, App 


31 


32 


68 . 


1 


91 


15 


US- 10- 05 4-2 9 5-2 20 


^ 

Sequence 


22 0, App 


32 


32 


68 . 


1 


91 


15 


US- 10- 05 4 -611-2 20 


Sequence 


22 0, App 


33 


32 


68 . 


1 


115 


15 


US- 10- 15 6- 7 61- 14 021 


Sequence 


14021, A 


34 


32 


68 . 


1 


128 


10 


US-09-840-459-86 


Sequence 


8 6, Appl 


35 


32 


68 . 


1 


208 


10 


US- 09-7 64-8 68- 652 


Sequence 


652, App 


36 


32 


68 . 


1 


208 


10 


US- 09-7 64- 8 68-1 07 8 


Sequence 


1 r\'~) d 7\ „ 

1078, Ap 


37 


32 


68 . 


1 


610 


9 


US-09-74 8-1 07-2 


Sequence i 


2, Appli 


38 


32 


68 . 


1 


610 


12 


US-10-28 1-34 6-2 


Sequence 


2, Appli 


39 


32 


68 . 


1 


660 


12 


US- 10-2 3 8- 07 5-7 32 


Sequence 


732, App 


40 


32 


68 . 


1 


757 


14 


US- 10- 117-84 6- 10 


Sequence 


10, Appl 


41 


32 


68 . 


1 


758 


14 


US- 10-11 7- 84 6-1 6 


Sequence 


16, Appl 


42 


32 


68 . 


1 


815 


9 


US-0 9- 8 15-2 4 2 -5 106 


Sequence . 


510 6, Ap 


43 


32 


68 . 


1 


1532 


10 


US- 09-7 38- 62 6- 4 321 


Sequence 


4321, Ap 


44 


31 


66. 


0 


8 


12 


US-10-072-419-13 


Sequence 


13, Appl 


45 


31 


66. 


0 


56 


12 


US- 10-1 95-7 30- 19 8 


Sequence 


198, App 


46 


31 


66. 


0 


68 


10 


US- 09-7 64-855-93 


Sequence 


93, Appl 


47 


31 


66 . 


0 


68 


15 


US-10-072~34 9-93 


Sequence 


93, Appl 


48 


31 


66. 


0 


72 


15 


US-10-156-761-8551 


Sequence 


8551, Ap 


49 


31 


66. 


0 


265 


9 


US-09~742-693-30 


Sequence 


30, Appl 


50 


31 


66. 


0 


274 


12 


US-10-023-617-5 


Sequence 


5, Appli 


51 


31 


66. 


0 


274 


16 


US-10-176-306-67 . 


Sequence 


67, Appl 


52 


31 


66. 


0 


433 


16 


US-10-080-170-547 


Sequence 


547, App 


53 


31 


66. 


0 


454 


15 


US-10-156-761-11138 


Sequence 


11138, A 


54 


31 


66. 


0 


486 


9 


US-09-925-301-1326 


Sequence 


1326, Ap 


55 


31 


66. 


0 


494 


9 


US-09-815-242-11055 


Sequence 


11055, A 


56 


31 


66. 


0 


866 


16 


US-10-222-038-2 


Sequence 


2, Appli 



57 


31 


66 . 


0 


1352 


11 


US- 09-784-554 B- 2 


Sequence 


2, Appli 


58 


31 


66. 


0 


2710 


15 


US- 10- 01 1-3 6 6- 6 


Sequence 


6, Appli 


59 


31 


66. 


0 


2862 


9 


US- 09- 7 42 -65 9-5 


Sequence 


5,r Appli 


60 


31 


66. 


0 


2864 


8 


US- 08- 4 24 -55 OB- 3 94 


Sequence 


394, App 


61 


31 


66. 


0 


2864 


9 


us- 09-7 42- 659-2 


Sequence 


2f Appli 


62 


31 


66 . 


0 


2865 


9 


US -09-742- 65 9- 6 


Sequence 


6, Appli 


63 


30 


63. 


8 


8 


12 


us- 10- 07 2-4 19-10 


Sequence 


10, Appl 


64 


30 


63. 


8 


56 


9 


us- 09- 8 64-7 61-4 4 972 


Sequence 


44972, A 


65 


30 


63. 


8 


67 


10 


us- 09- 7 64- 8 7 7- 12 93 


Sequence 


12 93, Ap 


66 


30 


63 . 


8 


76 


15 


us- 10-07 4 -4 7 5-2 7 7 


Sequence 


277, App 


67 


30 


63 , 


8 


78 


9 


US- 0 9- 8 64-7 51-4 6012 


Sequence 


4dU12 , A 


68 


30 


63 . 


8 


133 


12 


US- 10- 08 4- 8 4 3-2 8 8 


Sequence 


2 88, App 


69 


30 


63 . 


8 


133 


12 


US-10- 193-002-2 83 


Sequence 


2 8 3, App 


70 


30 


63 . 


8 


157 


15 


US- 10-22 7 -62 9-2 0 


Sequence 


20, Appl 


71 


30 


63 . 


8 


193 


12 


US- 10-2 3 0-3 3 1-2 3 


Sequence 


23, Appl 


72 


30 


63 . 


8 


214 


12 




Sequence 


/ o u y , Ap 


73 


30 


63 . 


8 


216 


10 


US-09-73 8-62 6-57 4 8 


Sequence 


57 4 8, Ap 


74 


30 


63 . 


8 


249 


10 


US-09-712-363-172 


Sequence 


172, App 


75 


30 


63 . 


8 


252 


16 


US- 10-27 8 -53 6- 190 


Sequence 


190, App 


76 


30 


63 . 


8 


337 


12 


US- 10-2 3 8- 07 5- 113 5 


Sequence 


1135, Ap 


77 


30 


63 . 


8 


381 


15 


US-10-15 6-7 61-8102 


Sequence 


8102, Ap 


78 


30 


63 . 


8 


4 4 0 


16 


US-10-080-17 0-164 


^ _ 

Sequence 


164, App 


79 


30 


63 . 


8 


603 


10 


US-09- 97 3-457-2 


Sequence 


2, Appli 


80 


30 


63 . 


8 


630 


9 


US- 09- 84 3-5 98 -10 


Sequence 


10, Appl 


81 


30 


63 . 


8 


684 


12 


US -10-032-58 5-7759 


Sequence 


7759, Ap 


82 


30 


63 , 


8 


714 


12 


US-10-032-58 5-7 532 


Sequence 


7532, Ap 


83 


30 


63 . 


8 


807 


12 


US-10- 096-57 8-3 


Sequence 


3, Appli 


84 


30 


63, 


8 


1350 


11 


us- 09-7 8 4 -5 5 4B- 4 


Sequence 


4 , App 1 i 


85 


30 


63. 


8 


1456 


10 


US- 09- 8 7 0-759-95 


Sequence 


95, Appl 


86 


30 


63. 


8 


1456 


12 


US-09-751-708A-95 


Sequence 


95, Appl 


87 


30 


63. 


8 


1881 


11 


US-09- 998-42 5-3 


Sequence 


3, Appli 


88 


30 


63. 


8 


1881 


11 


us- 09- 9 97 -97 7- 3 


Sequence 


3, Appli 


89 


30 


63. 


8 


4590 


15 


us- 10- 16 0-7 5 8- 13 


Sequence 


13, Appl 


90 


30 


63. 


8 


4590 


15 


us -10-16 0-758-14 


Sequence 


14, Appl 


91 


30 


63. 


8 


4590 


15 


US -10- 060- 03 6- 157 


Sequence 


157, App 


92 


29 


61.7 


Q 
O 




Ub— lU-U /z-4iy-lD 


Sequence 


lo, AppX 


93 


29 


61. 


7 


8 


12 


US-10-072-419-23 


Sequence 


23, Appl 


94 


29 


61. 


7 


18 


9 


US-09-864-761~47870 


Sequence 


47870, A 


95 


29 


61. 


7 


72 


9 


US-09-864-761-45981 


Sequence 


45981, A 


96 


29 


61. 


7 


82 


11 


US-09-764-891-3090 


Sequence 


3090, Ap 


97 


29 


61. 


7 


82 


15 


US-10-205-428-275 


Sequence 


275, App 


98 


29 


61. 


7 


92 


11 


US-09-764-891-3603 


Sequence 


3603, Ap 


99 


29 


61. 


7 


94 


15 


US-10-214-932-44 


Sequence 


44, Appl 


100 


29 


61. 


7 


118 


11 


US-09-530-139-22 


Sequence 


22, Appl 



ALIGNMENTS 



RESULT 1 
US-10-072-419-4 

; Sequence 4, Application US/10072419 

; Publication No. US20030162717A1 

; GENERAL INFORMATION: 

; APPLICANT: Schacter, Bernice 

; APPLICANT: Schacter, Lee 



; TITLE OF INVENTION: Compositions and Methods for Promoting Lipid Mobilization 
in Humans 

; FILE REFERENCE: 10739-1 

; CURRENT APPLICATION NUMBER: US/10/072,419 

; CURRENT FILING DATE: 2002-02-07 

; NUMBER OF SEQ ID NOS : 42 

; SOFTWARE: Patentin version 3.0 

; SEQ ID NO 4 

; LENGTH: 8 

; TYPE: PRT 

; ORGANISM: Schistocerca gregaria 
US-10-072-419-4 

Query Match 100.0%; Score 47; DB 12; Length 8; 

Best Local Similarity 100.0%; Pred. No. 5.3e+05; 

Matches 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 ELNFSTGW 8 

I I I I I I I I 
Db 1 ELNFSTGW 8 



RESULT 15 
US-09-801-196-33 

; Sequence 33, Application US/09801196 

; Patent No. US20020037827A1 

; GENERAL INFORMATION: 

; APPLICANT: Wang, Kai 

; APPLICANT: Smith, Ryan 

; APPLICANT: Fajardo, Mark 

; APPLICTVNT: Moss, Patrick 

; TITLE OF INVENTION: A NOVEL MATRIX METALLOPROTEINASE (MMP-25) 

; TITLE OF INVENTION: EXPRESSED IN SKIN CELLS 

; FILE REFERENCE: 240083.509 

; CURRENT APPLICATION NUMBER: US/09/801,196 

; CURRENT FILING DATE: 2001-03-06 

; NUMBER OF SEQ ID NOS: 37 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 33 
; LENGTH: 183 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-801-196-33 



Query Match 74.5%; Score 35; DB 9; Length 183; 

Best Local Similarity 75.0%; Pred. No. 70; 

Matches 6; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 1 ELNFSTGW 8 

II II II 
Db 162 ELGFSRGW 169 



RESULT 27 
US-09-899-495-78 



; Sequence 78, Application US/09899495 

; Publication No. US20030088060A1 

; GENERAL INFORMATION: 

; APPLICANT: Benjamin;, Christopher W. 

; APPLICANT: Roberts, Steven L. 

; APPLICANT: Karnovsky, Alia M. 

; APPLICANT: Ruble, Cara L. 

; TITLE OF INVENTION: Human Ion Channels 

; FILE REFERENCE: 00188US1 

; CURRENT APPLICATION NUMBER: US/09/899, 495 

; CURRENT FILING DATE: 2001-07-05 

; PRIOR APPLICATION NUMBER: 60/215,815 

; PRIOR FILING DATE: 2000-07-05 

; PRIOR APPLICATION NUMBER: 60/216,481 

; PRIOR FILING DATE: 2000-07-06 

; PRIOR APPLICATION NUMBER: 60/216,479 

; PRIOR FILING DATE: 2000-07-06 

; PRIOR APPLICATION NUMBER: 60/216,482 

; PRIOR FILING DATE: 2000-07-06 

; PRIOR APPLICATION NUMBER: 60/217,096 

; PRIOR FILING DATE: 2000-07-10 

; NUMBER OF SEQ ID NOS : 125 

; SOFTWARE: Patentin version 3.0 

; SEQ ID NO 78 
LENGTH: 72 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-899-495-78 

Query Match 68.1%; Score 32; DB 11; Length 72; 

Best Local Similarity 100.0%; Pred. No. le+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 4 FSTGW 8 

Mill 

Db 18 FSTGW 22 



Search completed: October 2, 2003, 08:54:45 
Job time : 27 sees 



GenCore version 5 . 1 . 4_p5_457 8 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



April 11, 2003, . 18: 59: 09 ; Search time 29 Seconds 

(without alignments) 
56.841 Million cell updates/sec 

PCT-US03"03800~2 0 
49 

1 ELNFSPNW 8 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

671580 seqs, 206047115 residues 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 
Maximum DB seq length: 25 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



9297 



Database : SPTREMBL_21 : * 

1: sp_archea:* 

2: sp_bacteria : * 

3: sp_fungi:* 

4 : sp__human : * 

5: sp^invertebrate : * 

6 : sp_maramal : * 

7: sp_mhc:* 

8: sp__organelle : * 

9: sp^phage:* 

10: sp_plant:* 

11: sp_rodent:* 

12: sp_virus:* 

13: sp_vertebrate : * 

14: sp_unclassif ied: * 

15: sp__rvirus : * 

1 6 : sp_bacteriap : * 

17 : sp_archeap : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB 



ID 



Description 



1 


26 


53 . 


1 


15 


6 


Q9TRA6 


Q9t:ra6 bos taurus 


2 


24 


49 . 


0 


19 


6 


Q9N1W2 


Q9nlw2 equus cabal 


3 


24 


49 . 


0 


24 


2 


Q56150 


yDbiou s treptomyce 


4 


24 


49 . 


0 


24 


6 


Q9N1W1 


Q9nlwl equus cabal 


5 


24 


49 . 


0 


25 


3 


Q96VQ6 


Q96vq6 neurospora 


6 


23 


46 , 


9 


16 


12 


Q9YQ11 


Q9yqll transmissib 


7 


23 


46. 


9 


18 


8 


Q9ZYW4 


Q92yw4 habrobracon 


8 


23 


46 . 


9 


23 


2 


Q446o2 


y44bo^: Dacilius am 


9 


23 


46. 


9 


25 


2 


087961 


Oo7961 nelicobacte 


10 


22 


44 . 


9 


19 


8 


Q9ZYW3 


Q9zyw3 gnamptodon 


11 


22 


44 . 


9 


20 


4 


Q92855 


Q^ZoDD nortio sapxen 


12 


22 


44 . 


9 


22 


13 


Q9PRQ6 


Q9prq6 morone saxa 


13 


22 


44 . 


9 


23 


4 


Q16233 


Qlb^oo nomo sapien 


14 


22 


44 . 


9 


23 


11 


Q9CV68 


Q9cv68 mus musculu 


15 


21 


42 . 


9 


13 


8 


Q9MQK0 


Q9mqk0 cervus elap 


16 


21 


42 . 


9 


16 


4 


Q9UCG5 


Q9ucg5 homo sapien 


17 


21 


42 . 


9 


16 


4 


Q9UCA6 


Q9uca6 homo sapien 


18 


21 


42 . 


9 


18 


2 


Q9R4V9 


Q9r4v9 campylobact 


19 


21 


42 . 


9 


19 


8 


Q9ZYW8 


Q9zyw8 ichneutes b 


20 


21 


42 . 


9 


20 


6 


P82666 


P82666 bos taurus 


21 


21 


42 . 


9 


22 


4 


Q9UNS8 


Q9uns8 homo sapien 


22 


21 


42 . 


9 


22 


4 


Q96Q24 


Q96q24 homo sapien 


23 


21 


42 . 


9 


22 


15 


Q75446 


Q75446 human immun 


24 


21 


42 . 


9 


23 


11 


Q91ZE2 


Q91ze2 rattus norv 


25 


21 


42 . 


9 


23 


1 1 
11 


Q91V21 


yyiv^iX rattus noxv 


26 


21 


42 . 


9 


23 


16 


026030 


O^ibUoU neXicoDacte 


27 


21 


42 , 


9 


24 


3 


Q9Uk8o 


Q9ur88 aspergillus 


28 


20 


40 , 


8 


10 


8 


Q8SHB1 


Q8shbl rhampholeon 


29 


20 


40 . 


8 


10 


8 


Q8SH99 


Q8sh99 brookesia n 


30 


20 


40. 


8 


11 


8 


Q9G649 


Q9g649 otocryptis 


31 


20 


40 . 


8 


14 


2 


Q45876 


Q4DO /b cxostriaium 


32 


20 


40 . 


8 


14 


2 


Q45872 


Q4oo fZ cXostrxaium 


33 


20 


40 . 


8 


16 


2 


Q9R4 F2 


Q9r4f2 streptomyce 


34 


20 


40 . 


8 


17 


3 


Q9URC6 


Q9urc6 saccharomyc 


35 


20 


40 . 


8 


18 


11 


Q9QW51 


Q9qw51 mus sp. act 


36 


20 


40 . 


8 


19 


2 


^ ft CI XT O 

QyRoNo 


Q9r5n8 Clostridium 


37 


20 


4 U . 


Q 

o 




lU 


Iro o^: Ub 


P 8 32 06 gynostemma 


38 


20 


40. 


8 


20 


2 


Q9R4D9 


Q9r4d9 escherichia 


39 


20 


40. 


8 


20 


6 


Q9TTG3 


Q9ttg3 ateles belz 


40 


20 


40. 


8 


22 


2 


Q52009 


Q52009 pseudomonas 


41 


20 


40. 


8 


23 


11 


P70635 


P70635 rattus norv 


42 


20 


40. 


8 


25 


2 


Q9R685 


Q9r685 bacillus sp 


43 


20 


40. 


8 


25 


5 


Q9NDS5 


Q9nds5 drosophila 


44 


20 


40. 


8 


25 


12 


Q9WMG7 


Q9wmg7 sigma virus 


45 


19.5 


39. 


8 


18 


2 


030588 


030588 streptomyce 



ALIGNMENTS 



RESULT 1 
Q9TRA6 



ID 
AC 
DT 
DT 
DT 



Q9TRA6 

Q9TRA6; 

Ol-MAY-2000 

Ol-MAY-2000 

Ol-JUN-2002 



PRELIMINARY; 

(TrEMBLrel. 13, 
(TrEMBLrel. 13, 
(TrEMBLrel. 21, 



PRT; 



15 AA. 



Created) 

Last sequence update) 
Last annotation update) 



DE PA7 00 subunit P31=ATP- dependent 20 S proteasome activator 

DE (Fragment) , 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=94342244; PubMed=8063704 ; 

RA DeMartino G.N., Moomaw C.R., Zagnitko O.P.^ Proske R.J., Chu-Ping M. , 

RA Afendis S.J., Swaf field J.C., Slaughter C.A. ; 

RT "PA7 00, an ATP-dependent activator of the 2 0 S proteasome, is an 

RT ATPase containing multiple members of a nucleotide-binding protein 

RT family."; 

RL J. Biol. Chem. 269:20878-20884(1994). 

FT NON__TER 1 1 

FT NON_TER 15 15 

SQ SEQUENCE 15 AA; 1659 MW; D189812E9389B755 CRC64; 



Query Match 53. 1%; 

Best Local Similarity 83.3%; 
Matches 5; Conservative 



Score 26; DB 6; Length 15; 
Pred. No. 1.9e+02; 
0; Mismatches 1; Indels 



0; Gaps 



Qy 1 ELNFSP 6 

I I I I I 
Db 5 ELNFLP 10 



Search completed: April 11, 2003, 19:01:03 
Job time : 32 sees 



GenCore version 5 , 1 , 4_p5_457 8 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



April 11, 2003, 18:58:50 ; Search time 11 Seconds 

(without alignments) 
30.165 Million cell updates/sec 

PCT-US03-03800-20 
49 

1 ELNFSPNW 8 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 
112892 seqs, 41476328 residues 



Total number of hits satisfying chosen parameters: 



1520 



Minimum DB seq length: 0 
Maximum DB seq length: 25 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt_40 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


46 


93. 


9 


8 


1 


HTF TENMO 


P25419 


tenebrio mo 


2 


43 


87. 


8 


8 


1 


HTFl PERAM 


P04548 


periplaneta 


3 


40 


81. 


6 


8 


1 


RPCH_PANBO 


P08939 


pandalus bo 


4 


40 


81. 


6 


10 


1 


HTFl ROMMI 


P18110 


romalea mic 


5 


38 


77. 


6 


8 


1 


AKH MELML 


P25423 


melolontha 


6 


37 


75. 


5 


8 


1 


HTF2 PERAM 


P04549 


periplaneta 


7 


37 


75. 


5 


10 


1 


HTF2_CARM0 


P11385 


carausius m 


8 


37 


75. 


5 


10 


1 


HTF_NAUCI 


P10939 


nauphoeta c 


9 


35 


71. 


4 


8 


1 


AKH_LIBAU 


P25418 


libellula a 


10 


31 


63. 


3 


8 


1 


AKH TABAT 


P14595 


tabanus atr 


11 


31 


63. 


3 


10 


1 


HTF__TABAT 


P14596 


tabanus atr 


12 


29 


59. 


2 


8 


1 


AKHG GRYBI 


P14086 


gryllus bim 


13 


26 


53. 


1 


10 


1 


HTF_HELZE 


P16353 


heliothis z 


14 


25 


51. 


0 


18 


1 


A2M OCTVU 


P30800 


octopus vul 


15 ' 


23 


46. 


9 


10 


1 


AKHX_LOCMI 


P81626 


locusta mig 


16 


21 


42. 


9 


21 


1 


CFPA TREPH 


P56738 


treponema p 


17 


20 


40. 


8 


13 


1 


YPNP PHOLU 


P41122 


photorhabdu 



18 


19 


38 


. 8 


6 


1 


EIOI LITRU 


Ti o o ri 


litoria rub 


19 


19 


38 


. 8 


11 


1 


R?\NC_RANPI 


Ti r\ o Ci a '] 

PUo y oi 


rana pipien 


20 


19 


38 


. 8 


18 


1 


NPA BOVIN 


n 1 R c: A 

PlooUo 


bos taurus 


21 


19 


38 


. 8 


23 


1 


FMK7_PSEAE 


Q53391 


p s e udomo n a s 


22 


19 


38 


. 8 


25 


1 


ATPO SPIOL 


P8u082 


spinacia ol 


23 


18 


36 


. 7 


10 


1 


BRK ONCMY 


yypr z 1 


oncorhynchu 


24 


18 


36 


. 7 


13 


1 


BOML PSEGU 


P4«i y y 1 


pseudophryn 


25 


18 


36 


. 7 


19 


1 


LPRM STAA.U 


r U o U bo 


s taphylococ 


26 


18 


36 


. 7 


20 


1 


CRTC_SPIOL 


P30806 


spinacia ol 


21 


17 


34 


. 7 


10 


1 


BPP2 BGTIN 




bothrops in 


28 


17 


34 


. 7 


10 


1 


BPP2 BOTJA 


P U 1 U Z Z 


bothrops ja 


29 


17 


34 


, 1 


1 0 


1 




D 1 Q 1 1 Q 

f 1 y 1 1 o 


nicotiana p 


30 


17 


34 


. 7 


12 


1 


LMTl LOCMI 


T-| O O O C 

P22395 


locusta mig 


31 


17 


34 


. 7 


13 


1 


BRK_PARI D 


P42717 


parapolybia 


32 


17 


34 


. 7 


13 


1 


LMT4 LOCMI 


T5 yi "1 /ion 


locusta mig 


33 


17 


34 


. 7 


14 


1 


MAST PARID 


P4 z /lb 


parapolybia 


34 


17 


34 


. 7 


14 


1 


MAST VESXA 


PU lolo 


vespa xanth 


35 


17 


34 


. 7 


15 


1 


GLN2 PINPS 


Polio / 


pinus pinas 


36 


17 


34 


. 7 


16 


1 


VPR HV1S3 


P19555 


hiirnan immun 


37 


17 




• / 


1 / 


1 




DPI ^ ^ ^ 


pinus pinas 


38 


17 


34 


.7 


17 


1 


VESP_VESMC 


P57672 


vespula mac 


39 


17 


34 


. 7 


19 


1 


BRKM BOMMX 


P83055 


bombina max 


40 


17 


34 


.7 


19 


1 


C0X4_0NCMY 


P80327 


oncorhynchu 


41 


17 


34 


.7 


19 


1 


CXR CONTU 


P58811 


conus tulip 


42 


17 


34 


.7 


19 


1 


RS19_SPICI 


031159 


spiroplasma 


43 


17 


34 


.7 


20 


1 


HGLl FASHE 


P80527 


fasciola he 


44 


17 


34 


.7 


22 


1 


HGL2 FASHE 


P80530 


fasciola he 


45 


17 


34 


.7 


23 


1 


lAPP LEPEU 


Q07333 


lepus europ 



ALIGNMENTS 



RESULT 1 
HTF_TENMO 

ID HTF__TENMO STANDARD; PRT; 8 AA. 

AC P25419; 

DT Ol-MAY-1992 (Rel. 22, Created) 

DT Ol-FEB-1994 (Rel. 28, Last sequence update) 

DT 01-FEB^1994 (Rel. 28, Last annotation update) 

DE Hypertrehalosaemic factor (HOTH) (Hypertrehalosemic neuropeptide) . 

OS Tenebrio molitor (Yellow mealworm) , and 

OS Zophobas rugipes. 

OC Eukaryota; Metazoa; Arthropoda; Mandibulata; Pancrustacea; Hexapoda; 

OC Insecta; Pterygota; Neoptera; Endopterygota; Coleoptera; Polyphaga; 

OC Cucujif ormia; Tenebrionidae; Tenebrio. 

OX NCBI_TaxID=7067, 7075; 

RN [1] 

RP SEQUENCE. 

RC TISSUE=Corpora cardiaca; 

RX MEDLINE=90341081; PubMed=238 1871 ; 

RA Gaede G. , Rosinski G. ; 

RT "The primary structure of the hypertrehalosemic neuropeptide from 

RT tenebrionid beetles: a novel member of the AKH/RPCH family,"; 

RL Peptides 11:455-459(1990). 

CC FUNCTION: HYPERTREHALOSAEMIC FACTORS ARE NEUROPEPTIDES THAT 

CC ELEVATE THE LEVEL OF TREHALOSE IN THE HEMO LYMPH (TREHALOSE IS 



cc THE MAJOR CARBOHYDRATE IN THE HEMOLYMPH OF INSECTS). 

CC SIMILARITY: BELONGS TO THE AKH / HRTH / RPCH FAMILY. 

DR PIR; A43976; A43976. 

DR PIR; B43976; B43976. 

DR InterPro; IPR002047; AKH. 

DR PROSITE; PS00256; AKH; 1. 

KW Neuropeptide; Amidation. 

FT MOD_RES 1 1 PYRROLIDONE CARBOXYLIC ACID. 

FT MOD__RES 8 8 AMIDATION. 

SQ SEQUENCE 8 AA; 1005 MW; 86745775B9C44736 CRC64/ 



Query Match 93,9%; 
Best Local Similarity 87.5%; 
Matches 7; Conservative 



Score 46; DB 1 ; Length 8; 
Pred. No. l.le+05; 
1; Mismatches 0; Indels 



Qy 1 ELNFSPNW 8 

: I I I I- IJ I 
Db 1 QLNFSPNW 8 



Search completed: April 11, 2003, 19:00:26 
Job time : 12 sees 



GenCore version 5 . 1 . 4__p5_4578 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: April 11, 2003, 18:59:35 ; Search time 15 Seconds 

(without alignments) 
51.272 Million cell updates/sec 

Title: PCT-US03-03800-20 
Perfect score: 49 
Sequence: 1 ELNFSPNW 8 



Scoring table; BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283224 seqs, 96134422 residues 

Total number of hits satisfying chosen parameters: 4984 

Minimum DB seq length: 0 
Maximum DB seq length: 25 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



PIR_73:* 
pirl:* 
pir2:* 

pir3:* 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



i 


46 


93. 9 


8 


2 


A43976 


hyper trehalosemic 


2 


46 


93. 9 


8 


2 


B43976 


hypertrehalosemic 


3 


46 


93.9 


8 


2 


A05169 


neuropeptide M-I - 


4 


43 


87.8 


8 


2 


S08995 


hypertrehalosemic 


5 


43 


87 . 8 


8 


2 


A49823 


adipokinetic hormo 


6 


43 


87, 8 


8 


2 


A44960 


neuropeptide Led-C 


7 


41 


83.7 


8 


2 


S21663 


neuropeptide - flo 


8 


40 


81. 6 


8 


2 


A61348 


red pigment- concen 


9 


38 


77.6 


8 


2 


S15422 


adipokinetic hormo 


10 


38 


77.6 


8 


2 


A58641 


adipokinetic hormo 


11 


38 


77. 6 


10 


2 


S53789 


neuropeptide Pec-H 


12 


37 


75.5 


8 


2 


S08996 


hypertrehalosemic 


13 


37 


75.5 


8 


2 


B49823 


adipokinetic hormo 



14 


. 37 


75 . 5 


8 


2 


B44960 


neuropeptide Led-C 


15 


37 


75.5 


10 


2 


A60421 


hyper trehal OS emic 


16 


37 


75.5 


10 


2 


S08997 


hypertrehalosemic 


17 


37 


75.5 


10 


2 


S08998 


hypertrehalosemic 


18 


37 


75.5 


10 


2 


A26381 


hypertrehalosemic 


19 


37 


75 . 5 


10 


2 


JC14 16 


hypertrehalosemic 


20 


37 


75.5 


10 


2 


S09138 


hypertrehalosemic 


21 


35 


71.4 


8 


2 


S10596 


adipokinetic hormo 


22 


35 


71.4 


8 


2 


S11545 


adipokinetic hormo 


23 


34 


69. 4 


8 


2 


S55310 


adipokinetic hormo 


24 


34 


69 . 4 


8 


2 


A58 62 0 


adipokinetic hormo 


25 


31 


63 . 3 


8 


2 


A33995 


adipokinetic hormo 


26 


31 


63 . 3 


10 


2 


B33995 


hypotrehalosemic h 


27 


29 


59 . 2 


8 


2 


A28004 


adipokinetic hormo 


28 


26 


53 . 1 


10 


2 


A31571 


hypertrehalosemic/ 


29 


25 


51 . 0 


17 


2 


B31435 


adherence lectin 1 


30 ' 


25 


51 . 0 


18 


2 


S23971 


alpha-macroglobuli 


31 


25 


51 . 0 


24 


2 


A3 9 5 0 9 


mannose-speci f ic 1 


32 


24 


49.0 


9 


2 


A24244 


adipokinetic hormo 


33 


24 


49. 0 


18 


2 


PT0286 


Ig heavy chain CDR 


34 


23 


46. 9 


22 


2 


PQ0070 


T-cell receptor be 


35 


23 


46.9 


23 


2 


T50545 


reductase [importe 


36 


22 


44.9 


22 


2 


D32537 


T-cell receptor al 


37 


22 


44 . 9 


24 


2 


T46622 


hypothetical prote 


38 


21 


42.9 


8 


2 


A14683 


aspartate transami 


39 


21 


42.9 


23 


2 


D64707 


hypothetical prote 


40 


20 


40.8 


7 


4 


A58725 


virotoxin - destro 


41 


20 


40.8 


14 


2 


S58B62 


botulinum neurotox 


42 


20 


40.8 


14 


2 


S58866 


botulinum neurotox 


43 


20 


40.8 


16 


2 


A24099 


crystal protein, 2 


44 


20 


40.8 


16 


2 


S51610 


hypothetical prote 


45 


20 


40.8 


20 


2 


S67990 


neurotoxin-associa 



ALIGNMENTS 



RESULT 1 
A43976 

hypertrehalosemic hormone - yellow mealworm 
C; Species: Tenebrio molitor (yellow mealworm) 

C;Date: 03-Feb-1993 #sequence_revision 03-Feb-1993 #text_change 07-May-1999 
C;Accession: A43976 
R;Gaede, G. ; Rosinski, G. 
Peptides 11, 455-459, 1990 

A; Title: The primary structure of the hypertrehalosemic neuropeptide from 

tenebrionid beetles: a novel member of the AKH/RPCH family. 

A;Reference number: A43976; MUID: 90341081; PMID:2381871 

A;Accession: A43976 

A;Molecule type: protein 

A; Residues: 1-8 <GAE> 

C; Super family : adipokinetic hormone 

C; Keywords: amidated carboxyl end; corpora cardiaca; hormone; neuropeptide; 
pyroglutamic acid 

F;l/Modified site: pyrrolidone carboxylic acid (Gin) #status experimental 
F;8/Modified site: amidated carboxyl end (Trp) #status experimental 



Query Match 93.9%; Score 46; DB 2; Length 8; 

Best Local Similarity 87.5%; Pred. No. 2.8e+05; 

Matches 7; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ELNFSPNW 8 

: i I I I I I I 
Db 1 QLNFSPNW 8 



Search completed: April 11, 2003, 19:01:23 
Job time : 16 sees 



GenCore version 5 . 1 . 4_p5__457 8 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



April 11, 2003, 19:00:30 ; Search time 14 Seconds 

(without alignments) 
34.935 Million cell updates/sec 

PCT-US03-03800-20 
49 

1 ELNFSPNW 8 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 248812 seqs, 61136040 residues 

Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 
Maximum DB seq length: 25 



54760 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



Published 
/ cgn2 
/cgn2 
/ cgn2 
/ cgn2_ 
/cgn2_ 
/ cgn2_ 
/ cgn2__ 
/ cgn2_ 
/cgn2_ 
/ cgn2 
/cgn2 
/cgn2 
/cgn2 
/ cgn2 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 



Applications_AA: * 
6/ptodata/2 /pubpaa 
6/ptodata/2/pubpaa 
6 /ptodat a/2 /pubpaa 
6/ptodat a/2 /pubpaa 
6/ptodata/2 /pubpaa 
6/ptodata/2 /pubpaa 
6/ptodata/2 /pubpaa 
6/ptodata/2 /pubpaa 
6/ptodata/2/pubpaa 
_6/ptodata/2/pubpa 
6/ ptodata/ 2 / pubpa 
6/ptodat a/ 2 /pubpa 
6/ptodata/2/pubpa 
6/ptodat a/ 2 /pubpa 



/US 0 8_NEW_PUB . pep : * 
/PCT_NEW_PUB.pep: * 
/US06_NEW_PUB.pep: * 
/US06_PUBCOMB.pep: * 
/US07_NEW_PUB.pep: * 
/US07_PUBCOMB.pep: * 
/PCTUS_PUBCOMB . pep : * 
/US 0 8__PUBC0MB . pep : * 
/US09_NEW_PUB.pep: * 
a/US0 9_PUBCOMB.pep: * 
a/US 10_NEW_PUB . pep : * 
a/US 10_PUBCOMB . pep : * 
a/US 60_NEW_PUB . pep : * 
a/US 60 PUBCOMB.pep:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 






Description 


1 


31 


63.3 


10 


10 


US- 


09- 


767-460-40 


Sequence 40, Appl 


2 


28 


57.1 


12 


10 


us- 


09- 


832-723-78 


Sequence 78, Appl 


3 


27 


55.1 


2.5 


10 


us- 


09- 


864-761-40831 


Sequence 4 0831, A 



4 


26 


53 . 


1 


17 


9 


US-09"y9 6- 634-00 


Secjuence 88^ Appl 


5 


26 


53 . 


1 


17 


9 


US- 09- 9 9 6- 63 4- 8 9 


Sequence 89, Appl 


6 


26 


53 • 


1 


17 


9 


US— 09-*9y /-lol — oo 


Sequence 88, Appl 


7 


26 


53 . 


1 


17 


9 


US-09-' 997-18 1 — 8 9 


Sequence 89, Appl 


8 


26 


53 . 


1 


17 


9 


US- 09- 997-1 82— 8 8 


Sequence 88, Appl 


9 


26 


53 . 


1 


17 


9 


US-09-997-lo2-8 9 


Sequence 89, Appl 


10 


26 


53 . 


1 


25 


9 


US- 10- 0 0 1-8 7 0-12 6 


Sequence 12 6, App 


11 


25 


51 . 


0 


7 


10 


US-09-2 05-558-2 30 


Sequence 230, App 


12 


25 


51 . 


0 


8 


9 


US- 09- 8 4 9- 092 -2 


Sequence 2, Appli 


13 


25 


51 . 


0 


10 


10 


US-09-767-4 60-8 8 


Sequence 88, Appl 


14 


25 


51 . 


0 


15 


10 


US-09-7 91-17 1-171 


Sequence 171, App 


15 


25 


51 . 


0 


20 


9 


US- 10- 19 6-7 03-11 


Sequence 11, Appl 


16 


24 


49 , 


0 


9 


9 


US-0 9- 8 09-638-13 6 


Sequence 13 6, App 


17 


24 


49 . 


0 


9 


9 


US- 09- 8 09- 63 8 -2 6 4 


Sequence 264, App 


18 


24 


49 . 


0 


9 


9 


US- 09- 8 09- 638-530 


Sequence 530, App 


19 


24 


49 . 


0 


9 


9 


US- 09- 8 09-dJo-d4 4 


Sequence 64 4, App 


20 


24 


49 . 


0 


10 


9 


US-09- 0 U 9- Doo-loz 


Sequence 182, App 


21 


24 


49 . 


0 


10 


9 


US-09-809-O38-2 /4 


Sequence 27 4, App 


22 


24 


49 . 


0 


10 


9 


US- 09- 8 09- 638 -4 92 


Sequence 492, App 


23 


24 


49 . 


0 


10 


9 


US- 09- 8 09- 63 8 -5 8 0 


Sequence 580, App 


24 


24 


49 . 


0 


10 


9 


US-09-809-638-O86 


Sequence 58 6, App 


25 


24 


49 . 


0 


1 0 


9 


US- 09- 8 09- 63 8 -69 6 


Sequence 696, App 


26 


24 


49 . 


0 


10 


10 


US-Oy- /o/-4oU-ob 


Sequence 3 6, Appl 


27 


24 


49 . 


0 


13 


9 


US- 0 9- 8 8 0-7 4 8 -2 2 12 


Sequence ^AL^Lf Ap 


28 


24 


49 . 


0 


14 


9 


TTi~i r\ r\ ^ c\ c\ ooA TA 

US-09-7 9 8- 8 8 9-7 9 


Sequence 79, Appl 


29 


24 


49 . 


0 


15 


9 


US-10— Olz-oy b-9DO 


Sequence 965, App 


30 


24 


49 . 


0 


15 


9 


US— 0 9— 8 yo-oi4— y 00 


Sequence 965, App 


31 


24 


4 9 . 


0 


15 


10 


us-uy-yoo-ciu-oo 


Sequence 8 6, Appl 


32 


24 


49 . 


0 


15 


10 


US-09-953-510~8 / 


Sequence 87, Appl 


33 


24 


49 . 


0 


17 


10 


US- 0 9- 8 64- / 61-3o / 3 / 


Sequence oo/o/, a 


34 


24 


49 . 


0 


18 


9 


tt r\ r\ A"1A AAAtv '50A 

US-09-910-009A-389 


Sequence 389, App 


35 


24 


49 . 


0 


19 


9 


TT^ f\ f\ • AAA A C r\ -I 

US- 09- 9 09- 4 60-17 


Sequence 17, Appl 


36 


24 


49 . 


0 


19 


10 


US-0 9- 864-761-39331 


Sequence jjyool, A 


37 


24 


49 . 


0 


2 0 


8 


US- U o-4z 4-00UB- j4 1 


Sequence 341, App 


38 


24 


49. 


0 


23 


9 


US-09-809-638-719 


Sequence 719, App 


39 


24 


49. 


0 


25 


10 


US-09-8 64-7 61-4 4695 


Sequence 44695, A 


40 


24 


49. 


0 


25 


10 


US-09-864-761-45769 


Sequence 45769, A 


41 


23 


46. 


9 


5 


10 


US-09-870-379-5 


Sequence 5, Appli 


42 


23 


46. 


9 


7 


9 


US-09-229~75lA-21 


Sequence 21, Appl 


43 


23 


46. 


9 


9 


9 


US-10~046-922-38 


Sequence 38, Appl 


44 


23 


46. 


9 


13 


9 


US-10-097-175-67 


Sequence 67, Appl 


45 


23 


46. 


9 


16 


9 


US-09-750-754-20 


Sequence 20, Appl 



ALIGNMENTS 



RESULT 1 

US-09-767-460-40 

; Sequence 40, Application US/09767460 

; Patent No, US2 0020009756A1 

; GENERAL INFORMATION: 

; APPLICANT: Mandell., Arnold 

; APPLICANT: Selz, Karen 

; APPLICANT: Shlesinger, Michael 

; TITLE OF INVENTION: Algorithmic Design of Peptides for Binding and/or 
Modulation of the 



; TITLE OF INVENTION: Functions of Receptors and/or Other Proteins 

; FILE REFERENCE: 01561-0002-CPUSOl 

; CURRENT APPLICATION NUMBER: US/09/767,460 

; CURRENT FILING DATE: 2001-01-23 

; NUMBER OF SEQ ID NOS : 96 

; SOFTWARE: Patentin version 3.0 

; SEQ ID NO 40 

; LENGTH: 10 

; TYPE: PRT 

; ORGANISM: Artificial Sequence 

; FEATURE : 

OTHER INFORMATION: Synthetic 
US-09-767-460-40 

Query Match 63.3%; Score 31; DB 10; Length 10; 

Best Local Similarity 66.7%; Pred. No, 10; 

Matches 4; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 3 NFSPNW 8 

I I : I I 
Db 3 NFTPEW 8 



RESULT 2 

US-09-832-723-78 . 

; Sequence 78, Application US/09832723 

; Patent No. US20020098524A1 

; GENERAL INFORMATION; 

; APPLICANT: Estell, David A. 

; APPLIC7\NT: Chen, Yiyou 

; APPLICANT: Murray, Christopher J. 

; APPLICANT: Tijerina, Pilar 

; TITLE OF INVENTION: METHODS FOR SELECTIVE TARGETING 
; FILE REFERENCE; GC617-2 

; CURRENT APPLICATION NUMBER: US/ 09/832 , 723 

; CURRENT FILING DATE: 2001-04-11 

; PRIOR APPLICATION NUMBER: US 60/197,259 

; PRIOR FILING DATE: 2000-04-14 
; NUMBER OF SEQ ID NOS: 117 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 78 
; LENGTH: 12 

TYPE: PRT 
; ORGANISM: Artificial Sequence 
; FEATURE : 

; OTHER INFOIMATION: peptides screened from a phage display random 

; OTHER INFORMATION: peptide library 

US-09-832-723-78 

Query Match 57.1%; Score 28; DB 10; Length 12; 

Best Local Similarity 66.7%; Pred. No. 42; 

Matches 4; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 3 NFSPNW 8 

I I I I 
Db 1 NFFPTW 6 



Search completed: April 11. 2003, 19:02:06 
Job time : 15 sees 



GenCore version 5 • 1 . 4_p5_457 8 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: April 11, 2003, 19:00:15 ; Search time 14 Seconds 

(without alignments) 
16.813 Million cell updates/sec 



Title: 

Perfect score : 
Sequence: 

Scoring table: 



PCT-US03-03800-20 
49 

1 ELNFSPNW 8 

BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 



Searched: 



262574 seqs, 29422922 residues 



Total number of hits satisfying chosen parameters: 



147762 



Minimum DB seq length: 0 
Maximum DB seq length: 25 



Post-processing 



Minimum Match 
Maximum Match 
Listing first 



0% 

100% 

45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/ l/iaa/5A_C0MB . pep : * 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB.pep: ^ 

6: /cgn2_6/ptodata/l/iaa/backfilesl .pep: * 

Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


35 


71 


.4 


10 


2 


US- 


08-747-137-122 


Sequence 


122, App 


2 


33 


67 


. 3 


21 


1 


US- 


08-190-788A-14 


Sequence 


14, Appl 


3 


33 


67 


.3 


21 


1 


US- 


08-190-788A-253 


Sequence 


253, App 


4 


33 


67 


.3 


21 


1 


US- 


08~383-474B-20 


Sequence 


20, Appl 


5 


33 


67 


.3 


21 


1 


US- 


08-465-391A-14 


Sequence 


14, Appl 


6 


33 


67 


.3 


21 


1 


US- 


08-465-391A-253 


Sequence 


253, App 


7 


33 


67 


.3 


21 


2 


us- 


08-464-538B-14 


Sequence 


14, Appl 


8 


33 


67 


.3 


21 


2 


us- 


08-464-538B-253 


Sequence 


253, App 


9 


33 


67 


. 3 


21 


2 


us- 


08-463-076E-23 


Sequence 


23, Appl 


10 


33 


67 


.3 


21 


2 


us- 


08-463-076E-310 


Sequence 


310, App 


11 


31 


63 


. 3 


15 


2 


us- 


08-553-257A-50 


Sequence 


50, Appl 



12 


31 


63. 


3 


25 


1 


US-0 8-4 68-7 09b- 16 


Sequence 


16, Appl 


13 


31 


63. 


3 


25 


2 


US- 0 8-2 4 1- 6 64 B- 16 


Sequence 


16 1. Appl 


14 


28 


57 . 


1 


9 


4 


US- 09-64 4-600-21 


Sequence 


21, Appl 


15 


28 


57. 


1 


12 


2 


US- 08-934-222-51 


Sequence 


51;- Appl 


16 


28 


57 . 


1 


12 


2 


US- 08- 9 33-4 02- 51 


Sequence 


51, Appl 


17 


28 


57 . 


1 


12 


2 


US -09-207- 62 1-51 


Sequence 


51, Appl 


18 


28 


57 . 


1 


12 


2 


US-0 8-532- 8 18- 51 


Sequence 


51, Appl 


19 


28 


57. 


1 


12 


3 


US- 09-23 1-7 97-51 


Sequence 


51, Appl 


20 


28 


57 . 


1 


12 


3 


US- 08-93 4-22 4-51 


Sequence 


51, Appl 


21 


28 


57 . 


1 


12 


3 


US- 08-933-84 3-51 




51, Appl 


22 


28 


57 . 


1 


12 


4 


US-0 8- 9 34-22 3- 51 


Sequence 


51, Appl 


23 


28 


57 . 


1 


12 


4 


US-09-4 13-4 92-51 


Sequence 


51, Appl 


24 


28 


57 . 


1 


13 


2 


US- 08- 4 8 0-1 9 0-2 6 


Sequence 


26, Appl 


25 


28 


57 . 


1 


13 


2 


US- 08-4 8 8-379-26 


Sequence 


26, Appl 


26 


28 


57 . 


1 


13 


5 


PGT-US93-07545-2 6 


Sequence 


26, Appl 


27 


26 


53 . 


1 


10 


3 


US-08-159-339A-4 4 8 


Sequence 


4 4 8, App 


28 


26 


53 . 


1 


15 


2 


US-0 8- 5 53- 2 5 7A- 4 9 


Sequence 


4 9, Appl 


29 


26 


53 . 


1 


17 


4 


US- 0 8- 9 9 0-8 2 3- 8 8 


Sequence 


8 8, Appl 


30 


26 


53 . 


1 


17 


4 


US- 08- 9 9 0-82 3- 8 9 


Sequence 


89, Appl 


31 


26 


53 . 


1 


17 


4 


US-09-025-769B-2 61 


Sequence 


261, App 


32 


26 


53 . 


1 


20 


1 


US- 0 8 -2 00- 90 OA- 31 


Sequence 


31, Appl 


33 


26 


53 . 


1 


20 


5 


PCT-US94-0 061 6-31 


Sequence 


31, Appl 


34 


26 


53 . 


1 


25 


1 


US-08-4 68-7 09B-14 


Sequence 


14, Appl 


35 


26 


53. 


1 


25 


2 


US- 0 8-2 4 1- 6 64 B- 14 


Sequence 


14, Appl 


36 


25 


51 . 


0 


7 


4 


US- 09-25 8-754-4 02 


Sequence 


402, App 


37 


25 


51 . 


0 


7 


4 


US-09-042-107-402 


Sequence 


402, App 


38 


25 


51. 


0 


8 


1 


US-08-375~962B-2 


Sequence 


2, Appli 


39 


25 


51. 


0 


8 


2 


US-08-562-114B~2 


Sequence 


2, Appli 


40 


25 


51. 


0 


8 


4 


US-08-729-594A-2 


Sequence 


2, Appli 


41 


25 


51. 


0 


8 


4 


US-08-937-993-2 


Sequence 


2, Appli 


42 


25 


51. 


0 


10 


2 


US-08-934-222-74 


Sequence 


74, Appl 


43 


25 


51. 


0 


10 


2 


US-08-933-402-74 


Sequence 


74, Appl 


44 


25 


51. 


0 


10 


2 


US-09-207-621~74 


Sequence 


74, Appl 


45 


25 


51. 


0 


10 


2 


US-08-532~818-74 


Sequence 


74, Appl 



ALIGNMENTS 



RESULT 1 

US-08-747-137-122 

; Sequence 122, Application US/08747137 

; Patent No. 5945033 

; GENERAL INFORMATION: 

; APPLICANT: YEN, Richard C.K. 

; TITLE OF INVENTION: NON-CROSSLINKED PROTEIN PARTICLES FOR 

; TITLE OF INVENTION: THERAPEUTIC AND DIAGNOSTIC USE 

; NUMBER OF SEQUENCES: 184 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Townsend and Townsend and Crew LLP 

; STREET: Two Embarcadero Center, 8th Floor 

; CITY: San Francisco 

; STATE: CA 

; COUNTRY: USA 

; ZIP: 94111 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 



COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentin Release #1.0,. Version #1.30 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/747,137 

; FILING DATE: 12-NOV~1996 

CLASSIFICATION: 424 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER:. US 08/212,546 

; FILING DATE: 14-MAR-1994 

; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 08/069,831 

FILING DATE: 01~JUN-1993 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 07/959,560 

FILING DATE: 13-OCT-1992 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 07/641,720 

; FILING DATE: 15-JAN-1991 

; ATTORNEY/AGENT INFORMATION: 
; NAME: Apple, Randolph T. 

; REGISTRATION NUMBER: 36,429 

; REFERENCE/DOCKET NUMBER: 016197-000840US 

; TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-576-0200 

; INFORMATION FOR SEQ ID NO: 122: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 10 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: not relevant 

; TOPOLOGY: not relevant 

FEATURE: 

; N7VME/KEY: Modif ied-site 

■; LOCATION: 1 

; OTHER INFORMATION: /product= "p-Glu" 

FEATURE: 

; NAME/KEY: - Modif ied-site 

; LOCATION: 10 

OTHER INFORMATION: /product= "Thr-Amide" 
US-08-747-137-122 

Query Match 71.4%; Score 35; DB 2; Length 10 

Best Local Similarity 71.4%; Pred. No. 1.4; 

Matches 5; Conservative 1; Mismatches 1; Indels 

Qy 2 LNFSPNW 8 

: I I I I I 
Db 2 VNFSPGW 8 



RESULT 2 

US-08-190-788A~14 

; Sequence 14, Application US/08190788A 

; Patent No. 5608035 

; GENERAL INFORMATION: 

; APPLICANT: Yanofsky, Stephen D. 

; APPLICANT: Barrett, Ronald W. 



APPLICANT: Baldwin, David N. 
; APPLICANT: Jacobs, Jeff W. 

; TITLE OF INVENTION: Peptides and Compounds That Bind to the 

; TITLE OF INVENTION: IL-1 Receptor 
; NUMBER OF SEQUENCES: 312 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Affymax Technologies N.V. 

; STREET: 4001 Miranda Avenue 

; CITY: Palo Alto 

; STATE: California 

COUNTRY: USA 
; ZIP: 94304 

; COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: PatentIn Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/ 08/190 , 7 88A 

; FILING DATE: 02-FEB-1994 

CLASSIFICATION: 530 
; PRIOR APPLICATION DATA: 

; APPLICATION NUMBER: US 07/847,567 

; FILING DATE: 05-MAR-1992 

; ATTORNEY/AGENT INFORMATION: 
; NAME: Stevens, Lauren L. 

REGISTRATION NUMBER: 36,691 
; REFERENCE/DOCKET NUMBER: 1019.1 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-496-2300 

; TELEFAX: 415-424-0832 

; INFORMATION FOR SEQ ID NO: 14: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 21 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 

; MOLECULE TYPE: peptide 
US-08-190-788A-14 



Query Match 67.3%; 
Best Local Similarity 62,5%; 
Matches 5; Conservative 

Qy 



Score 33; DB 1; Length 21; 
Pred. No. 6.9; 
1; Mismatches 2; Indels 0; Gaps 0; 



1 ELNFSPNW 8 

I : I I I I 
1 ENTYSPNW 8 



Search completed; April 11, 2003, 19:01:44 
Job time : 14 sees 



GenCore version 5 . 1 . 4_p5_457 8 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on : 



Title; 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



April 11, 2003, 18:58:34 ; Search time 34 Seconds 

(without alignments) 
31.353 Million cell updates/sec 

PCT-US03-03800-20 
49 

1 ELNFSPNW 8 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

908470 seqs, 133250620 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 25 



320064 



Post-processing: 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

A Geneseq_101002:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 



/SIDS2/gcgdata/geneseq/ genes eqp 
/ SIDS2/gcgdata/ genes eq/ genes eqp 
/ S IDS 2 /gcgdata/ genes eq/ genes eqp 
/SIDS2/gcgdata/ gene seq/ genes eqp 
/SIDS2 /gcgdata/ genes eq/ genes eqp- 
/S IDS 2/ gcgdata/ geneseq/ genes eqp 
/SIDS2/gcgdata/geneseq/ geneseqp- 
/SIDS2/gcgdata/geneseq/geneseqp 
/SIDS2 /gcgdata/ geneseq/ genes eqp 
/SIDS2/gcgdata/ geneseq/geneseqp 
/SIDS2 /gcgdata/ genes eq/geneseqp 
/SIDS2/ gcgdata/ geneseq/geneseqp 
/SIDS2/ gcgdata/geneseq/ geneseqp 
/ SIDS2/gcgdata/geneseq/ geneseqp 
/ SI DS2/ gcgdata/ genes eq/geneseqp 
/SIDS2/ gcgdata/geneseq/ geneseqp 
/SIDS2 /gcgdata/ geneseq/ genes eqp 
/SIDS2 /gcgdata/ geneseq/ genes eqp 
/SIDS2 /gcgdata/ genes eq/geneseqp 
/SIDS2/gcgdata/geneseq/ geneseqp 
/SIDS2 /gcgdata/ geneseq/ genes eqp 
/SIDS2/gcgdata/geneseq/geneseqp 
/SIDS2/gcgdata/ geneseq/geneseqp 



embl/AA1980.DAT: 
embl/AA1981.DAT: 
embl /]\A1 982. DAT : 
embl/AA1983 , DAT: 
embl/AA1984 . DAT: 
erabl/AA1985.DAT: 
embl/AA1986.DAT: 
embl/AA1987.DAT: 
erabl/AA198 8 . DAT: 
embl/AA198 9.DAT 
embl/ AA1990.DAT 
embl /AAl 9 91. DAT 
embl/7\Al992 .DAT 
.-embl/AAl9 93 .DAT 
embl /AAl 9 9 4. DAT 
embl /AAl 9 9 5. DAT 
embl/AA1996.DAT 
embl /AAl 9 9 7. DAT 
embl/AA199 8 .DAT 
-embl/AA1999 . DAT 
•einbl/AA2000.DAT 
•embl/AA2001.DAT 
-embl/7\A2002 . DAT 



Pred. Mo. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 



Lit 

fo. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


41 


83.7 


11 


22 


AAB92383 


Miscellaneous pept 


2 


40 


81.6 


10 


20 


AAY31073 


Non-crosslinked pr 


3 


33 


67.3 


21 


16 


AAR90518 


Interleukin-1 type 


4 


33 


67.3 


21 


18 


AAW16220 


Peptide containing 


5 


33 


67.3 


21 


18 


AAW15976 


Interleukin-1 type 


6 


33 


67.3 


21 


19 


AAW68853 


Peptide binding in 


7 


33 


67.3 


21 


19 


AAW68614 


Peptide binding in 


8 


33 


67.3 


21 


19 


AAW58086 


. Interleukin-1 type 


9 


33 


67.3 


21 


20 


AAY09989 


Interleukin-1 type 


10 


33 


67.3 


21 


20 


AAY09706 


Interleukin-1 type 


11 


33 


67.3 


21 


21 


AAB17160 


IL-1 antagonist pe 


12 


33 


67.3 


21 


21 


AAB17769 


IL~1 antagonist pe 


13 


33 


67.3 


21 


21 


AAB17817 


IL-1 antagonist pe 


14 


33 


67.3 


21 


23 


ABB72407 


Interleukin 1 anta 


15 


33 


67.3 


21 


23 


ABB72665 


Interleukin 1 anta 


16 


33 


67.3 


21 


23 


ABB72708 


Interleukin 1 anta 


17 


31 


63.3 


10 


23 


AAU82819 


Human Calcitonin t 


18 


31 


63.3 


15 


16 


AAR638 81 


Human tumour al pro 


19 


29 


59.2 


9 


19 


AAW83225 


NPF motif EH domai 


20 


28 


57.1 


5 


13 


AAR26473 


Serotinin release 


21 


28 


57.1 


9 


22 


AAB98511 


Human TADG-15 pept 


22 


28 


57.1 


12 


15 


AAR67058 


Immunomodulator pe 


23 


28 


57, 1 


12 


23 


ABB05292 


Collar soils bindi 


24 


28 


57.1 


13 


15 


AAR49316 


Beta2m position 52 


25 


28 


57.1 


23 


22 


AAU04076 


Human partial Beta 


26 


28 


57.1 


23 


22 


AAU04 077 


Human partial Beta 


27 


28 


57.1 


25 


22 


AAB69564 


Human Repro-EN-1.0 


28 


27 


55.1 


7 


19 


AAW65273 


Cysteine analogue 


29 


27 


55.1 


7 


19 


7^AW52003 


Peptide having imm 


30 


27 


55.1 


7 


19 


AAW51932 


Peptide #34 having 


31 


27 


55. 1 


7 


19 


AAW51053 


Penicillamine cent 


32 


27 


55.1 


12 


22 


AAB60035 


Internalising pept 


33 


27 


55.1 


23 


22 


AAU04078 


Human partial Beta 


34 


27 


55.1 


25 


22 


ABB418 08 


Peptide #9314 enco 


35 


27 


55.1 


25 


22 


ABB25533 


Protein #7532 enco 


36 


27 


55.1 


25 


22 


7^62680 


Human brain expres 


37 


27 


55.1 


25 


22 


7W475498 


Human bone marrow 


38 


27 


55.1 


25 


22 


AAM20613 


Peptide #7047 enco 


39 


27 


55.1 


25 


22 


AAM35604 


Peptide #9641 enco 


40 


27 


55.1 


25 


23 


ABG45057 


Human peptide enco 


41 


26 


53.1 


7. 


18 


AAW45058 


Immunomodulatory p 


42 


26 


53.1 


7 


19 


AAW51985 


Peptide having imm 


43 


26 


53.1 


7 


19 


AAW51136 


Methionine contain 


44 


26 


53 . 1 


7 


20 


AAY09470 


Immunoactive pepti 


45 


26 


53.1 


10 


15 


AAY38149 


Hepatitis B virus- 



ALIGNMENTS 



RESULT 1 



AAB92383 

ID AAB92383 standard; Peptide; 11 AA. 
XX 

AC AAB92383; 

XX 

DT 22-JUN-2001 (first entry) 
XX 

DE Miscellaneous peptide SEQ ID NO: 1559. 

XX 

KW Protection; endogenous therapeutic peptide; peptidase; conjugation; 

?CW blood component; modification; succinimidyl ; maleimido group; amino; 

KW hydroxyl; thiol; hormone; growth factor; neurotransmitter. 
XX 

OS Homo sapiens . 

OS Synthetic. 

XX 

PN W02 00 06990 0-A2. 
XX 

PD 23-NOV-2000. 
XX 

PF 17-MAY-2000; 2000WO-US13576 , 

XX 

PR 17-MAY~1999; 99US-0134406. 

PR lO-SEP-1999; 99US-0153406 . 

PR 15-OCT-1999; 99US-01597 83 . 
XX 

PA (CONJ-) CONJUCHEM INC. 
XX 

PI Bridon DP, Ezrin AM, Milner PG, Holmes DL, Thibaudeau K; 
XX 

DR WPI; 2001-112059/12. 

XX 

PT Modifying and attaching therapeutic peptides to albumin prevents 

PT peptidase degradation, useful for increasing length of in vivo activity 

PT 

XX 

PS Disclosure; Page 714; 733pp; English. 
XX 

CC The present invention describes a modified therapeutic peptide (I) 

CC comprising a therapeutically active amino acid region (III) and a 

CC reactive group (II) (e.g. succinimidyl and maleimido groups) attached to 

CC a less therapeutically active amino acid region (IV) , which covalently 

CC bonds with amino/hydroxyl/thiol groups on blood components to form a 

CC peptidase stabilised therapeutic peptide composed of 3-50 amino acids. 

CC (I) are useful for modifying therapeutic peptides e.g. hormones, growth 

CC factors and neurotransmitters, to protect them from peptidase activity 

CC in vivo for the treatment of various disorders. Endogenous therapeutic 

CC peptides are not suitable as drug candidates as they require frequent 

CC administration due to rapid degradation by peptidases in the body. 

CC Modifying and attaching therapeutic peptides to albumin prevents or 

CC reduces the action of peptidases to increase length of activity (half 

CC life) and specificity as bonding to large molecules decreases 

CC intracellular uptake and interference with physiological processes. 

CC AAB90829 to AAB92441 represent peptides which can be used in the 

CC exemplification of the present invention. 

XX 

SQ Sequence 11 AA; 



Query Match 83.7%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 41; DB 22; Length 11; 
Pred. No. 0.15; 
1; Mismatches 0; Indels 0; Gaps 



Qy 2 LNFSPNW 8 

I I I : I I I 
Db 2 LNFTPNW 8 



RESULT 3 
AAR90518 

ID AAR90518 standard; peptide; 21 AA. 
XX 

AC AAR90518; 

XX 

DT 14-MAR-1996 (first entry) 
XX 

DE Interleukin-1 type I receptor binding peptide #4. 
XX 

KW Interleukin-1 type I receptor; IL~1; IL-lRtI; atherosclerosis; 

KW rheumatoid arthritis; osteoporosis; HIV; AIDS; bacterial infection; 

KW respiratory distress syndrome; acute myelogenous leukaemia; 

KW coal miner pneumococcus ; alcoholic cirrhosis; cuprophane haemodyalisis ; 

KW cardiopulmonary bypass; chronic hepatitis B; thermal injury; 

KW reticulohistiocytosis ; sarcoidosis; tuberculosis; obstructive jaundice; 

KW Paget *s disease; osteomalacia; IDDM; Kawasaki's disease; 

KW inflammatory bowel disease; sepsis; toxic shock; luteal phase; therapy. 

XX . 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Region 13.. 20 

FT /note= "core sequence #1" 
XX 

PN WO9520973-A1. 
XX 

PD lO-AUG-1995. 

XX 

PF Ol-FEB-1995; 95WO-US01590 . 
XX 

PR 02-FEB-1994; 94US-0190788 . 
XX 

PA {AFFY-) AFFYMAX TECHNOLOGIES NV. 
XX 

PI Baldwin D, Barrett RW^ Jacobs JW, Yanofsky SD; 
XX 

DR WPI; 1995-283605/37. 
XX 

PT Interleukin-1 type I receptor binding compounds - used e.g. in the 

PT treatment of osteoporosis, HIV and hepatitis B 

XX 

PS Claim 22; Page 53; 56pp; English. 
XX 

CC The sequences represented by R909515--AAR90527 are interleukin-1 type I 

CC receptor (IL~lRtI) binding peptides. These peptides contain the core 

CC sequence represented by AAR83759. These sequences block the binding of 



CC IL-1 to IL-lRtl. The sequences were synthesised by using solid phase 

CC synthesis. These sequences are useful in vitro for studying the IL-1 

CC receptor binding process, for developing and assaying other compounds 

CC which bind to the receptor, and for measuring the expression of IL-lRtI 

CC on cell surfaces. They can also be used to monitor the effectiveness of 

CC treatments which influence IL-1 production. They could also be used for 

CC treating disorders which are susceptible to treatment with an IL-1 

CC inhibitor, e.g. atherosclerosis, rheumatoid arthritis, osteoporosis, 

CC HIV, AIDS, bacterial infection, respiratory distress syndrome, acute 

CC myelogenous leukaemia, coal miner pneumococcus, graft vs. host disease, 

CC alcoholic cirrhosis, cuprophane haemodyalisis , cardiopulmonary bypass, 

CC chronic hepatitis B, thermal injury, reticulohistiocytosis , sarcoidosis, 

CC tuberculosis, obstructive jaundice, Paget ' s disease, osteomalacia, IDDM, 

CC Kawasaki's disease, inflammatory bowel disease, sepsis, toxic shock and 

CC luteal phase. These compounds may also be conjugated so that they act 

CC as antagonists, or agonists, of IL-lRtI and may be used to direct a 

CC cytotoxic or therapeutic agent to a cell expressing this receptor. 
XX 

SQ Sequence 21 AA; 

Query Match 67.3%; Score 33; DB 16; Length 21; 

Best Local Similarity 62.5%; Pred. No. 12; 

Matches 5; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 ELNFSPNW 8 

I : I I I I 
Db 1 ENTYSPNW 8 



RESULT 14 
ABB72407 

ID ABB72407 standard; Peptide; 21 AA, 
XX 

AC ABB72407; 

XX 

DT 05-APR-2002 (first entry) 
XX 

DE Interleukin 1 antagonist peptide SEQ ID NO: 216. 

XX 

KW Modified peptide; mimetic; Fc domain; fusion; immunoglobulin G; IgG; 

KW EPO; erythropoietin; TPO; tumour necrosis factor alpha inhibitor; 

KW TNF-alpha inhibitor; interleukin 1 antagonist; IL-1 antagonist; TMP; 

KW TPO mimetic peptide; EPO mimetic peptide; EMP; VEGF antagonist; 

KW MMP inhibitor; antiinflammatory; antitumour; immunosuppressive; 

KW cytostatic; antirheumatic; antiarthritic; antidiabetic; ophthalmological ; 

KW antianaemic; anorectic; antiinf ertility; haemostatic; dermatological; 

KW neuroprotective; inflammatory disease; autoimmune disease; tumour growth; 

KW cancer; rheumatoid arthritis; diabetic retinopathy; infertility; obesity; 

KW sleep disorder; neurological degenerative disease; anaemia; 

KW thrombocytopaenia; metastatic tumour; systemic lupus erythematosus; 

KW Fanconi's syndrome. 

XX 

OS Homo sapiens. 

OS Synthetic. 

XX 

PN WO200183525-A2 . 
XX 

PD 08-NOV-2001. 



XX 

PF 02-MAY-2001; 2001WO-US14310 . 
XX 

PR 03-MAy-2000; 2000US-0563286 • 

XX 

PA (AMGE-) AMGEN INC. 
XX 

PI Feige U, Liu C, Cheetham JC, Boone TC, Gudas JM; 
XX 

DR WPI; 2002-130313/17. 

XX 

PT Novel vehicle-peptide molecule or its multimers useful for treating 

PT inflammatory and autoimmune diseases, cancer;, rheumatoid arthritis, 

PT diabetic retinopathy, obesity, sleep disorders and infertility - 
XX 

PS Claim 39; Page 32; 176pp; English. 
XX 

CC The present invention describes a vehicle-peptide molecule (I) or its 

CC multimers, (I) can have antiinflammatory, antitumour, immunosuppressive, 

CC cytostatic, antirheumatic, antiarthritic, antidiabetic, ophthalmological, 

CC antlanaemic, anorectic, antiinf ertility, haemostatic, dermatological and 

CC neuroprotective activities. (I) can be used as a therapeutic or 

CC prophylactic agent as well as for screening purposes. (I) is useful for 

CC diagnosing diseases characterised by dysfunction of their associated 

CC protein of interest, for identifying normal or abnormal proteins of 

CC interest, as a part of diagnostic kit to detect the presence of their 

CC proteins of interest in a biological sample. Additionally, (I) is useful 

CC for treating inflammatory and autoimmune diseases, tumour growth, cancer, 

CC rheumatoid arthritis, diabetic retinopathy, obesity, sleep disorders, 

CC infertility, and neurological degenerative diseases. (I), comprising 

CC EPO-mimetic compounds are useful for treating disorders characterised by 

CC low red blood cell levels such as anaemia. The TPO-mimetic comprising 

CC compounds are useful for treating conditions that involve an existing 

CC megakaryocyte/platelet deficiency or an expected megakaryocyte/platelet 

CC deficiency, such as thrombocytopaenia, aplastic anaemia, metastatic 

CC tumour which result in thrombocytopaenia, systemic lupus erythematosus, 

CC and Fanconi * s syndrome. ABB72403 to ABB73426 and ABL35695 to ABL35777 

CC represent amino acid and nucleic acid sequences used in the 

CC exemplification of the present invention, 

XX 

SQ Sequence 21 AA; 

Query Match 67.3%; Score 33; DB 23; Length 21; 
Best Local Similarity 62.5%; Pred. No. 12; 

Matches 5; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 ELNFSPNW 8 

I : M I I 

Db 1 ENTYSPNW 8 
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ACCESSION NUMBER: 
TITLE: 



INVENTOR (S) : 



PATENT ASSIGNEE (S) 
SOURCE : 



DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2003:717612 HCAPLUS 

Protein and cDNA sequences of a novel human G 
protein-coupled receptor hgprbmy6, expressed highly in 
small intestine 

Feder, John N.; Mintier, Gabriel; Ramanathan, Chandra 
S.; Hawken, Donald R. ; Cacace, Angela M. ; Barber, 
Lauren E.; Kornacker, Michael G,; Nelson, Thomas C. ; 
Bol, David K. 
USA 

U.S. Pat. Appl. Publ., 99 pp., Cont . -in-part of U.S. 

Ser. No. 966,422. 

CODEN: USXXCO 

Patent 

English 

2 



PATENT NO. 



KIND DATE 



US 2003170671 Al 20030911 

US 2003044892 Al 20030306 

PRIORITY APPLN. INFO.: 



APPLICATION NO. DATE 

US 2002-262272 20020927 

US 2001-966422 20010926 

US 2000-235602P P 20000927 

US 2001-306604P P 20010719 

US 2001-315412P P 20010828 
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US 2001-966422 A2 20010926 
AB The present invention provides protein and cDNA sequences of a newly 

discovered human G-protein coupled receptor hgprbmy6 that expresses highly 
in small intestine. Also described are expression vectors, host cells, 
agonists, antagonists, antisense mols., and antibodies assocd. with the 
polynucleotide and/or polypeptide of the present invention. In addn., 
methods for treating, diagnosing, preventing, and screening for disorders 
assocd. with aberrant cell growth and diseases or disorders related to the 
small intestine, 
IT 408305-90-8 

RL: PRP (Properties) 

(unclaimed sequence; protein and cDNA sequences of a novel human G 
protein-coupled receptor hgprbmy6, expressed highly in small intestine) 
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ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 

INVENTOR (S) : 



PATENT ASSIGNEE (S) 
SOURCE : 



DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2003:570453 HCAPLUS 
139:132437 

Lung cancer antigens and cDNAs encoding them and their 
diagnostic, prophylactic and therapeutic uses 
Mericle, Barbara; Fanger, Gary R. ; Vedvick, Thomas S,; 
Carter, Darrick; Watanabe, Yoshihiro; Henderson, 
Robert A.; Kalos, Michael D.; Spies, A. Gregory; Foy, 
Teresa M. ; Fan, Liqun; Wang, Tongtong 
Corixa Corporation, USA 

U.S. Pat. Appl. Publ., 75 pp., Cont . -in-part of U.S. 

Ser. No. 7,700. 

CODEN: USXXCO 

Patent 

English 

14 



PATENT NO. 


KIND 


DATE 


US 


2003138438 


Al 


20030724 


US 


6312695 


Bl 


20011106 


us 


2003119763 


Al 


20030626 


us 


6482597 


Bl 


20021119 


us 


6518256 


Bl 


20030211 


us 


6531315 


Bl 


20030311 


us 


6426072 


Bl 


20020730 


us 


2002052329 


Al 


20020502 


us 


2002115139 


Al 


20020822 


us 


2002147143 


Al 


20021010 


us 


2003064947 


Al 


20030403 



APPLICATION NO. 



DATE 



PRIORITY APPLN. INFO. 



US 
US 

us 
us 
us 
us 
us 
us 
us 
us 
us 

us 1998 
US 1998 
us 1998 
US 1999 
US 1999 
US 1999 
US 2000 
US 2000 
US 2000 
US 2000 
US 2000 
US 2000 
US 2000 
US 2000 
US 2000 
US 2001 
US 2001 
US 2001 



002-117982 

998- 123912 

999- 466396 
000-480884 
000-542615 
000-606421 
000-643597 

000- 735705 

001- 850716 
001-897778 
001-7700 

40802 

123912 

221107 
-285479 

466396 
-476496 

480884 
-510376 

542615 
-606421 
-630940 
-643597 
-662786 
-685696 

735705 

850716 
-897778 

7700 



B2 
A2 
A2 
A2 
A2 
A2 
A2 
A2 
A2 
A2 
A2 
A2 
B2 
A2 
A2 
A2 
A2 
A2 



20020405 
19980727 
19991217 
20000110 
20000404 
20000628 
20000821 
20001212 
20010507 
20010628 
20011130 
19980318 
19980727 
19981222 
19990402 
19991217 
19991230 
20000110 
20000222 
20000404 
20000628 
20000802 
20000821 
20000915 
20001009 
20001212 
20010507 
20010628 
20011130 
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WO 1999-US5798 Al 19990317 
AB Lung cancer-specific antigens are identified and characterized and cDNAs 
encoding a no, of them are cloned and characterized. The antigens, or 
epitopes derived from them may be useful in the diagnosis, prevention, or 
treatment of lung cancer. Characterization of a no. of antigens, 
including tissue distribution and the identification of T cell epitopes is 
demonstrated. 
IT 387817-80-3 

RL: PRP (Properties) 

(unclaimed sequence; lung cancer antigens and cDNAs encoding them and 
their diagnostic, prophylactic and therapeutic uses). 
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INVENTOR (S) : 
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SOURCE : 
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2003:491266 HCAPLUS 
139: 67789 

C-C chemokine antagonistic mutants for treating 
autoimmune and inflammatory diseases, cancers and 
infections 

Proudfoot, Amanda; Kosco-Vilbois, Marie; Shaw, Jeffrey 
Applied Research Systems Ars Holding N.V., Neth. 



PCT Int. Appl. 

CODEN: PIXXD2 

Patent 

English 

1 



59 pp. 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



WO 2003051921 


Al 


20030626 




WO 2002-EP14325 


20021216 
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MC, 


NL, 


PT, 
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TR, 
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BJ, 


CF, 


CG, CI, 


CM, 


GA, 


GN, 


GQ, 


GW, 


ML, 


MR, 


NE, 


SN, 


TD, 


TG 






















PRIORITY APPLN. 


INFO, 












EP 2001-761 




A 


20011217 







AB 



IT 



Mutants of specific CC-chemokines contg. a non-conservative substitution 

in a conserved consensus sequence act as CC-chemokine antagonists, and can 

be effectively used in the treatment of autoimmune and inflammatory 

diseases, cancers, and viral or bacterial infections. Particularly 

preferred are the RANTES/CCL5 mutants. 

549494-95-3 

RL: PRP (Properties) 

(unclaimed sequence; c-C chemokine antagonistic mutants for treating 
autoimmune and inflammatory diseases, cancers and infections) 



REFERENCE COUNT: 



8 THERE ARE 8 CITED REFERENCES AVAILABLE FOR THIS 

RECORD. ALL CITATIONS AVAILABLE IN THE RE FORMAT 
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TITLE: 



INVENTOR (S) 



HCAPLUS COPYRIGHT 2003 ACS on STN 
2003:356709 HCAPLUS 
138:362621 

Liver response-associated protein isoforms as 
biomarkers of liver response to exogenous agents, 
including drugs, and to disease 

Amacher, David E.; Fasulo, Lisa M. ; Herath, Herath 
Mudiyanselage Athua Chandrasiri; Holt, Gordon Duane; 
Stiger, Thomas R. 



Page 3 



Russel 10 073419 



PATENT ASSIGNEE (S): 

SOURCE: 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT; 
PATENT INFORMATION: 



Pfizer Products Inc., USA; Oxford Glycosciences (UK) 
Ltd. 

PCT Int. Appl., 256 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



WO 2003038444 


A2 


20030508 




WO 2002~US34847 


20021031 






W: AE, 


AG, 


AL, 


AM, 


AT, 


AU, 
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BA, 
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PT, 


RO, 


RU, 
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GR, 
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TR, 
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BJ, 


CF, 
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CI, 
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GQ, 


GW, 


ML, 


MR, 


NE, 


SN, 


TD, 
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PRIORITY APPLN. 


INFO, 












US 2001- 


335964P 


P 


20011031 







AB The present invention provides methods and compns . for screening, 

diagnosing and prognosing liver response, for monitoring the effectiveness 
of liver response to treatment, and for drug development. The method 
involves testing a biol . sample (blood, serum, plasma, urine, liver 
tissue) for Liver Response-Assocd . Protein Isoforms (LRPIs) . Antibodies 
or oligonucleotide probes can be used to detect LRPIs. A kit is also 
claimed comprising antibodies, other reagents, and instructions. 
IT 521939-37-7 

RL: PRP (Properties) 

(unclaimed sequence; liver response-assocd. protein isoforms as 
biomarkers of liver response to exogenous agents, including drugs, and 
to disease) 
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ACCESSION NUMBER: 
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INVENTOR (S) : 



PATENT ASSIGNEE (S): 
SOURCE : 



DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2003:261014 HCAPLUS 
138:302631 

Lung carcinoma-derived polypeptides, polynucleotides, 
probes and primers, and antibodies for cancer therapy 
and diagnosis 

Wang, Tongtong; Wang, Aijun; Skeiky, Yasir A. W.; Li, 
Samuel X.; Kalos, Michael D.; Henderson, Robert A.; 
McNeill, Patricia D. ; Fanger, Neil; Retter, Marc W.; 
Durham, Margarita; Fanger, Gary R.; Vedvick, Thomas 
S.; Carter, Darrick; Watanabe, Yoshihiro; Peckham, 
David W.; Cai, Feng; Foy, Teresa M. 
Corixa Corporation, USA 

U.S. Pat. Appl. Publ., 296 pp., Cont . -in-part of U.S. 

Ser. No. 897,778. 

CODEN: USXXCO 

Patent 

English 

14 



PATENT NO. 



US 2003064947 
US 6312695 
US 2003119763 



KIND DATE 



Al 20030403 
Bl 20011106 
Al 20030626 



APPLICATION NO. DATE 



US 2001-7700 
US 1998-123912 
US 1999-466396 



20011130 
19980727 
19991217 
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as 


6482597 


Bl 


20021119 


us 


6518256 


Bl 


20030211 


us 


6531315 


Bl 


20030311 


us 


6426072 


Bl 


20020730 


us 


2002052329 


Al 


20020502 


us 


2002115139 


Al 


20020822 


us 


2002147143 


Al 


20021010 


us 


2003138438 


Al 


20030724 



PRIORITY APPLN. INFO, 



AB 



IT 



us 2000-480884 20000110 
US 2000-542615 20000404 
US 2000-606421 20000628 
us 2000-643597 20000821 
US 2000-735705 20001212 
US 2001-850716 20010507 
US 2001-897778 20010628 
US 2002-117982 20020405 
US 1998-40802 A2 19980318 
US 1998-123912 A2 19980727 
US 1998-221107 A2 19981222 
US 1999-285479 A2 19990402 
US 1999-466396 A2 19991217 
US 1999-476496 A2 19991230 
US 2000-480884 A2 20000110 
US 2000-510376 A2 20000222 
US 2000-542615 A2 20000404 
US 2000-606421 A2 20000628 
US 2000-630940 A2 20000802 
US 2000-643597 A2 20000821 
US 2000-662786 A2 20000915 
US 2000-685696 A2 20001009 
US 2000-735705 A2 20001212 
US 2001-850716 A2 20010507 
US 2001-897778 A2 20010628 
WO 1999-US5798 Al 19990317 
US 2001-7700 A2 20011130 

Compns . and methods for the therapy and diagnosis of cancer, particularly 
lung cancer, are disclosed. Illustrative compns. comprise one or more 
lung tumor polypeptides, immunogenic portions thereof, polynucleotides 
that encode such polypeptides, antigen presenting cell that expresses such 
polypeptides, and T cells that are specific for cells expressing such 
polypeptides. The lung tumor antigens and encoding polynucleotides are 
isolated and characterized from human lung squamous cell carcinoma cDNA 
expression library by PCR-based subtraction, and human monoclonal 
antibodies and hybridomas are generated from transgenic mice. 
387817-80-3 

RL: BSU (Biological study, unclassified) ; DGN {Diagnostic use) ; PRP 
(Properties); THU (Therapeutic use); BIOL (Biological study); USES (Uses) 
(lung carcinoma-derived polypeptides, polynucleotides, probes and 
primers, and antibodies for cancer therapy and diagnosis) 
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2003:97435 HCAPLUS 
138: 149946 

Production of acylated polypeptides by recombinant 

expression of precursor proteins followed by acylation 

at the lysine . epsilon . -amino groups and proteolytic 

cleavage of the N-terminal extension 

Diers, Ivan; Balschmidt, Per; Markussen, Jan; 

Jonassen, lb; Egel-Mitani, Michi; Kjeldsen, Thomas 

Borglum 

Novo Nordisk A/S, Den. 

PCT Int, Appl., 42 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



PATENT NO. 
WO 2003010186 



KIND DATE 
A2 20030206 



APPLICATION NO. DATE 

WO 2002-DK502 20020718 



Page 5 



Russel 10 073419 



W: 



RW: 



AE, 


AG, 


AL, 


AM, 


AT . 


AU, 


AZ , 


BA, 


BB, 


BG, 


BR, 


BY, 


BZ , 


CA, 


CH, 


CN, 


CO, 


CR, 


CU, 


CZ, 


DE, 


DK, 


DM, 


DZ, 


EC, 


EE, 


ES, 


FI, 


GB, 


GD, 


GE, 


GH, 


GM, 


HR, 


HU, 


ID, 


IL, 


IN, 


IS, 


JP, 


KE, 


KG, 


KP, 


KR, 


KZ, 


LC, 


LK, 


LR, 


LS, 


LT, 


LU, 


LV, 


MA, 


MD, 


MG, 


MK, 


MN, 


MW, 


MX, 


MZ, 


NO, 


NZ, 


OM, 


PH, 


PL. 


PT, 


RO, 


RU, 


SD, 


SE, 


SG, 


SI, 


SK, 


SL, 


TJ, 


TM, 


TN, 


TR, 


TT, 


TZ, 


UA, 


UG, 


uz, 


VN, 


YU, 


ZA, 


ZM, 


zw. 


AM, 


AZ, 


BY, 


KG, 


KZ, 


MD, 


RU, 


TJ, TM 


GH, 


GM, 


KE, 


LS, 


MW, 


MZ, 


SD, 


SL, 


SZ, 


TZ, 


UG, 


ZM, 


ZW, 


AT, 


BE, 


BG, 


CH, 


CY, 


cz, 


DE, 


DK, 


EE, 


ES, 


FI, 


FR, 


GB, 


GR, 


IE, 


IT, 


LU, 


MC, 


NL, 


PT, 


SE, 


SK, 


TR, 


BF, 


BJ, 


CF, 


CG, 


CI, 


CM, 


GA, 


GN, 


GQ, 


GW, 


ML, 


MR, 


NE, 


SN, 


TD, 


TG 



























US 2003144471 
PRIORITY APPLN. INFO.: 



Al 20030731 



20020724 
20010724 
20010808 



OTHER SOURCE (S) : 
AB 



IT 



US 2002-205110 
DK 2001-1141 A 
US 2001-310793P P 
MARPAT 138:14994 6 

The present invention is related to a method of producing polypeptides in 
transformed host cells by expressing a precursor mol . of the desired 
polypeptide which is to be acylated at certain lysine , epsilon . -amino 
groups in a subsequent in vitro step. The N-terminal extensions allow for 
preferential acylation of the expressed precursor mol. and protects the 
expressed precursor mol. against proteolytic degrdn. within the host cell 
or in the culture medium. In addn., the precursor mol. is easier to 
purify and has a decreased tendency to form fibrils, thus allowing more 
flexibility when selecting down-stream sepn. and purifn. steps in large 
scale operations. The invention is also related to DNA sequences, 
vectors, and transformed host cells for use in the claimed method. 
Further, the present invention is related to certain precursors of the 
desired polypeptides and certain acylation methods. Thus 

EEAHK-Arg34 (glucagon-like peptide I) (7~37)-Lys26 . gamma . -Glu-hexadecanoyl 
is produced in 52% yield with acylation of the N-terminal extended 
GLP-l(7-37) with Glu (ONSu) N-hexadecanoyl Me ester in the presence of 2 
equiv of Zn2+ in CH3CN. 
494863-53-5 

RL: BUU (Biological use, unclassified); PRP (Properties); BIOL (Biological 

study) ; USES (Uses) 

(N~terminal extension; prodn. of acylated polypeptides by recombinant 
expression of precursor proteins followed by acylation at the lysine 
. epsilon . -amino groups and proteolytic cleavage of the N-terminal 

extension) 
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US 2001-283112P P 
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WO 2002-US11654 A 

AB Eighteen genes and their rasp, encoded proteins, and variants thereof, are 
described wherein the gene exhibits restricted expression in normal adult 
tissue and is overexpressed in various cancers. Suppression subtractive 
hybridization (SSH) is used to identify cDNAs corresponding to genes that 
are differentially expressed in cancer; PGR amplification, cloning, and 
sequencing of gene fragments from SSH yield the full-length cDNAs . 
Consequently, the gene products provide diagnostic, prognostic, 
prophylactic, and/or therapeutic targets for cancer. The genes or 
fragment thereof, their encoded proteins, or variants or fragments 
thereof, can be used to elicit a humoral or cellular immune response; 
antibodies or T cells reactive with the gene products can be used in 
active or passive immunization. [This abstr. record is one of 16 records 
for this document necessitated by the large no. of index entries required 
to fully index the document and publication system constraints.]. 

IT 473328-98-2 

RL: ANT (Analyte) ; BSU (Biological study, unclassified); DGN (Diagnostic 
use); PRP (Properties); THU (Therapeutic use); ANST (Analytical study); 
BIOL (Biological study); USES (Uses) 

(peptide epitope; human nucleic acids and corresponding proteins useful 

in the detection and treatment of various cancers) 
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The cDNAs and corresponding proteins corresponding to mRNAs differentially 
expressed in lung squamous cell carcinoma and lung adenocarcinoma are 
disclosed. Antibodies to the lung cancer-assocd. proteins and probes for 
lung cancer-assocd. nucleic acids may be used in diagnosis of lung cancer. 
The proteins and cDNAs, antibodies to the proteins, T cells specific for 
these tumor proteins, and antigen-presenting cells expressing an epitope 
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of these proteins may be used in treatment of lung cancer, e.g., in 
vaccines. Thus, many novel lung cancer-assocd . cDNAs/proteins were 
identified and their expression in normal and tumor tissues examd. Some 
of these cDNAs were expressed in E. coli, HEK293, and CHL-1 cells. N- and 
C-terminal fragments of one of the proteins were expressed as fusions with 
Mycobaceterium tuberculosis 32-kilodalton serine proteinase-derived 
peptide. CTL lines specific for one of the tumor antigens were generated 
by in vitro whole-gene priming. 
387817-80-3 
RL: PRP (Properties) 

(unclaimed sequence; lung cancer-assocd. cDNAs and proteins and their 

use in diagnosis and therapy) 
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PRIORITY APPLN. INFO.: US 2001-274676P P 20010309 

AB The present invention is directed to variants of antigens comprising 
folate binding protein epitopes as a compn. assocd. with providing 
immunity against a tumor in an individual. The variant is effective in 
inducing cytotoxic T-lymphocytes but preferably not to the extent that 
they become sensitive to silencing by elimination, such as by apoptosis, 
or by anergy, as in unresponsiveness. 
IT 460039-30-9 

RL: BSU (Biological study, unclassified); THU (Therapeutic use); BIOL 
(Biological study); USES (Uses) 

(cancer vaccines comprising variants of folate binding protein 
epitopes ) 
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the therapy and diagnosis of cancer, particularly lung cancer, are 
identified. Illustrative compns . comprise one or more lung tumor 
polypeptides, immunogenic portions thereof, polynucleotides that encode 
such polypeptides, antigen presenting cell that expresses such 
polypeptides, and T cells that are specific for cells expressing such 
polypeptides. The disclosed compns. are useful, for example, in the 
diagnosis, prevention and/or treatment of diseases, particularly lung 
cancer . 
387817-80-3 

RL: ANT (Analyte) ; BSU (Biological study, unclassified); BUU (Biological 
use, unclassified); DGN (Diagnostic use); PRP (Properties); THU 
(Therapeutic use); ANST (Analytical study); BIOL (Biological study); USES 
(Uses) 

(L523S peptide; differentially expressed sequences and proteins for use 
in therapy and diagnosis of human lung cancer) 
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AB The present invention relates to the use of cells or cell exts. contg. the 
F box protein Skp2 (S-phase kinase-assocd . protein 2), p27 (a cell 
cycle-regulated cyclin-dependent kinase inhibitor) or p27 C-terminal 
fragment, and Cksl (cyclin-dependent kinase subunit 1) for screening for 
potential therapeutic agents for the treatment of proliferative and 
dif f erentiative disorders, such as cancer. The action of the potential 
drug comprises modulation of the activity of Skp2, which may be detected 
as a change in interaction of Skp2 with p27 or Cksl, by a change in 
ubiquitination of p27, or by degrdn. of p27 or Cksl. 
IT 260388-05-4 

RL: PRP (Properties) 

(unclaimed sequence; cell or cell ext. contg. Skp2, p27, and Cksl for 
identification of compds . useful for treatment of proliferative and 
dif f erentiative disorders) 
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AB Compns, and methods for the therapy and diagnosis of cancer, particularly 
lung cancer, are disclosed. Illustrative compns . comprise one or more 
lung tumor polypeptides, immunogenic portions thereof, polynucleotides 
that encode such polypeptides, antigen presenting cell that expresses such 
polypeptides, and T cells that are specific for cells expressing such 
polypeptides. The disclosed compns. are useful, for example, in the 
diagnosis, prevention and/or treatment of diseases, particularly lung 
cancer . 

IT 387817-80-3 

RL: BSU (Biological study, unclassified); DGN (Diagnostic use); PRP 
(Properties); THU (Therapeutic use); BIOL (Biological study); USES (Uses) 
(lung tumor-specific antigen, chimeric antigens, polynucleotides, and 
antibodies for therapy and diagnosis of lung cancer) 
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AT, BE, CH, DE, 
IE, SI, LT, LV, 
PRIORITY APPLN, INFO.: 
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DK, ES, FR, GB, GR, IT, LI, LU, NL, SE, MC, PT, 
FI, RO, MK, CY, AL, TR 

US 2000-235602P P 20000927 
US 2001-306604P P 20010719 
US 2001-315412P P 20010828 
WO 2001-US30614 W 20010926 
The present invention describes a newly discovered human G-protein coupled 
receptor sequence homolog HGPRBMY6 and its encoding polynucleotide. The 
HGPRBMY6 is expressed highly in human small intestine and related to 
latrophilin, . alpha . -latrotoxin and CL3 receptors based on sequence 
similarity. The HGPRBMY6 can be used in drug screening, therapy and 
diagnosis of disorders related to over- and under-expression of HGPRBMY6. 
408305--90-8 
RL: PRP (Properties) 

(unclaimed sequence; cDNA and protein sequence of a novel human G 
protein-coupled receptor sequence homolog HGPRBMY6 and their uses in 
drug screening) 
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This invention provides proteins and fragments thereof derived from the 
bacteria Neisseria meningitidis serotype A, N. meningitidis serotype B, 
and N. gonorrhoeae, Th protein sequences disclosed in International 
Application patents WO 1999/57280 and WO 2000/22430 were subjected to 
computer anal, to predict antigenic peptide fragments, using three 
algorithms: AMPHI, ANTIGENIC INDEX, and HYDROPHOBICITY . Also provided are 
nucleic acids encoding for such proteins, polypeptides, and/or fragments, 
as well as nucleic acids complementary thereto (e.g., antisense nucleic 
acids). Addnl., this invention provides antibodies which bind to the 
proteins, polypeptides, and/or fragments. This invention further provides 
expression vectors useful for making the proteins, polypeptides, and/or 
fragments, as well as host cells transformed with such vectors. This 
invention also provides compns. of the protein fragments and/or nucleic 
acids for use as vaccines, diagnostic reagents, immunogenic compns., and 
the like, [This abstr. record is the sixth of 8 records for this document 
necessitated by the large no. of index entries required to fully index the 
document and publication system constraints.] 
375806-38-5 
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(Uses) 

(amino acid sequence; Neisseria meningitidis and N. gonorrhoeae 
antigens and the genes encoding them for use as vaccine and diagnostic 
compns . ) 
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AB Compns. and methods for the therapy and diagnosis of cancer, particularly 
lung cancer, are disclosed. Illustrative compns. comprise one or more^, 
lung tumor polypeptides, immunogenic portions thereof, polynucleotides 
that encode such polypeptides, antigen presenting cell that expresses such 
polypeptides, and T cells that are specific for cells expressing such 
polypeptides. The disclosed compns. are useful, for example, in the 
diagnosis, prevention and/or treatment of diseases, particularly lung 
cancer. 
IT 387817-80--3 

RL: PRP (Properties) 

(unclaimed sequence; compns. and methods for the therapy and diagnosis 
of lung cancer) 
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AB This invention provides proteins and fragments thereof derived from the 
bacteria Neisseria meningitidis serotype A, N. meningitidis serotype B, 
and N. gonorrhoeae. Th protein sequences disclosed in International 
Application patents WO 1999/57280 and WO 2000/22430 were subjected to 
computer anal, to predict antigenic peptide fragments, using three 
algorithms: AMPHI, ANTIGENIC INDEX, and HYDROPHOBICITY . Also provided are 
nucleic acids encoding for such proteins, polypeptides, and/or fragments, 
as well as nucleic acids complementary thereto (e.g., antisense nucleic 
acids). Addnl . , this invention provides antibodies which bind to the 
proteins, polypeptides, and/or fragments. This invention further provides 
expression vectors useful for making the proteins, polypeptides, and/or 
fragments, as well as host cells transformed with such vectors. This 
invention also provides compns. of the protein fragments and/or nucleic 
acids for use as vaccines, diagnostic reagents, immunogenic compns., and 
the like. [This abstr. is the fourth of 8 records for this codument 
necessitated by the large no. of index entries required to fully index the 
document and publication system constraints.] 

IT 375806-38-5 

RL: PRP (Properties); THU (Therapeutic use); BIOL (Biological study); USES 
(Uses) 

(amino acid sequence; Neisseria meningitidis and N, gonorrhoeae 
antigens and the genes encoding them for use as vaccine and diagnostic 
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Modified potato patatins are disclosed that maintain enzymic (lipid acyl 
hydrolase) and insecticidal activity while displaying reduced or 
eliminated allergenicity . Epitopes which bind to anti-patatin antibodies 
were identified, and removed via site directed mutagenesis. Tyrosines 
were obsd. to generally contribute to the allergenic properties of patatin 
proteins. Removal of glycosylation sites was obsd, to reduce or eliminate 
antibody binding. Permuteins are also disclosed which have a rearranged 
amino acid sequence while retaining enzymic activity. Deallerginized " 
patatins and permuteins can be used as insecticidal materials, as 
nutritional supplements, and as immunotherapeutic agents. 
350229-19-5 

RL: BSU {Biological study, unclassified) / BUU (Biological use, 
unclassified); PRP (Properties); BIOL (Biological study); USES (Uses) 

(prepn. of deallergenized patatins and permuteins and their use as 

insecticides ) 
350258-05-8 350258-06-9 350472-94-5 

350472- 97-8 

350473- 07-3 



350472-95-6 350472-96-7 

350472- 98-9 350473-05-1 

350473- 12-0 350473-13-^1 350473-16-4 
350473-71-1 

RL: PRP (Properties) 

(unclaimed sequence; prepn 



and their use as insecticides) 



of deallergenized patatins and permuteins 
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Improved lysosomal enzymes and lysosomal enzyme 

activators containing additional glycosylation sites 

introduced by site-specific mutagenesis 

Okkels, Jens Sigurd; Jensen, Anne Dam; Halkier, 

Torben; Jensen, Rikke Holding; Schambye, Hans 

Thalsgaard 

Maxygen Aps, Den. 

POT Int. Appl., 98 pp. 

CODEN: PIXXD2 

Patent 

English 

2 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



WO 2001049830 A2 20010712 

WO 2001049830 A3 20020207 

WO 2001049830 02 20021107 



WO 2000-DK743 



20001229 



W: 
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AG, 


AL, 
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AT, 


AU, 
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BB, 
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TJ, 
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RW: 


GH, 


GM, 


KE, 


LS, 


MW, 


MZ, 


SD, 


SL, 


sz. 


TZ, 


UG, 


zw. 


AT, 


BE, 


CH, 


CY, 




DE, 


DK, 
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FI, 


FR, 


GB, 


GR, 


IE, 


IT, 


LU, 


MC, 


NL, 


PT-, 


SE, 


TR, 


BF, 




BJ, 


CF, 


CG, 


CI, 


CM, 


GA, 


GN, 


GW, 


ML, 


MR, 


NE, 


SN, 


TD, 


TG 






>002127219 


Al 


20020912 




US 


; 2000-75312( 


5 


20001229 






L246915 




A2 


20021009 




EP 2000-98720^ 


3 


20001229 






R: 


AT, 


BE, 


CH, 


DE, 


DK, 


ES, 


FR, 


GB, 


GR, 


IT, 


LI, 


LU, 


NL, 


SE, 


MC, 


PT, 



WO 2002002597 
WO 2002002597 



IE, SI, LT, LV, FI, RO, MK, CY, AL, TR 
A2 20020110 
A3 20020627 



WO 2001-DK459 



20010629 



W: AE, 


AG, 


AL, 


AM, 


AT, 


AU, 


A2, 


BA, 


BB, 


BG, 


BR, 


BY, 


BZ, 


CA, 


CH, 


CN, 


CR, 


CU, 


CZ, 


DE, 


DK, 


DM, 


DZ, 


EE, 


ES, 


FI, 


GB, 


GD, 


GE, 
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BE, 


CH, 


CY, 


DE, 
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FI, 


FR, 


GB, 


GR, 
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PT, 


SE, 


TR, 
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BJ, 


CF, 
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CI, 
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ML, 


MR, 


NE, 


SN, 


TD, 


TG 







US 2003036181 Al 20030220 

EP 1299535 A2 20030409 

R: AT, BE, CH, DE, DK, ES, FR, GB, GR, IT, LI, 
IE, SI, LT, LV, FI, RO, MK, CY, AL, TR 
PRIORITY APPLN. INFO.: 



US 2001-896896 20010629 
EP 2001-944987 20010629 

LU, NL, SE, MC, PT, 



AB 



DK 


1999- 


1891 


A 


19991230 


DK 


2000- 


865 


A 


20000602 


DK 


2000- 


866 


A 


20000602 


DK 


2000- 


1027 


A 


20000630 


US 


2000- 


174652P 


P 


20000106 


US 


2000- 


210984P 


P 


20000612 


US 


2000- 


211124P 


P 


20000612 


US 


2000- 


217497P 


P 


20000711 


DK 


2000- 


1092 


A 


20000714 


US 


2000- 


225558P 


P 


20000816 


WO 


2000- 


DK743 


W 


20001229 


WO 


2001- 


DK90 


W 


20010209 


WO 


2001- 


DK459 


w 


20010629 


"OUp 


of lysosomal 


enzymes and lysosomal 



enzyme activators, comprising at least one introduced glycosylation site 
as compared to a corresponding parent enzyme or activator. By introducing 
addnl. glycosylation sites the resulting glycosylated lysosomal enzyme or 



Page 29 



Russel 10 073419 



activator obtains improved in vivo activity and thereby provides for 
improved treatment of lysosomal storage diseases. Thus, in particular, 
glycosylation sites (Ser or Thr residues) are introduced into human 
glucocerebrosidase, and esp. in peptide addns . to the N-terminus of 
glucocerebrosidase . A fusion protein was also constructed comprising 
saposin C, a linker peptide, and human glucocerebrosidase . The N-glycan 
structures in glucocerebrosidase and its variants expressed in insect 
cells are characterized. 
IT 350581-18-9 

RL: PRP (Properties) 

(unclaimed sequence; improved lysosomal enzymes and lysosomal enzyme 
activators contg. addnl . glycosylation sites introduced by 
site-specific mutagenesis) 
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2001 : 489670 HCAPLUS 
135: 88016 

Nucleic acids containing single nucleotide 

polymorphisms in the human genome 

Shimkets, Richard A. ; Leach, Martin 

Curagen Corp., USA 

PCT Int. Appl., 484 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



WO 2001048245 A2 20010705 WO 2000~US35346 20001227 

WO 2001048245 A3 20021128 



W: 


AE, 


AG, 


AL, 
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AT, 


AU, 
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GH, 


GM, 


KE, 


LS, 
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MZ, 


SD, 


SL, 


SZ, 


TZ, 


UG, 


ZW, 


AT, 


BE, 


CH, 


CY, 




DE, 


DK, 
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FI, 


FR, 


GB, 


GR, 


IE, 


IT, 


LU, 
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NL, 


PT, 


SE, 


TR, 
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BJ, 
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CG, 
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CM, 
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GN, 


GW, 
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MR, 


NE, 
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TD, 


TG 






.282726 




A2 


20030212 




EP 2000-990358 


20001227 






R: 


AT, 


BE, 


CH, 


DE, 


DK, 


ES, 


FR, 


GB, 


GR, 


IT, 


LI, 


LU, 


NL, 


SE, 


MC, 


PT, 




IE, 


SI, 


LT, 


LV, 


FI, 


RO, 


MK, 


CY, 


AL, 


TR 















PRIORITY APPLN. INFO.: US 1999-472688 A2 19991227 

WO 2000-US35346 W 20001227 

AB The invention provides 651 nucleic acids contg. single-nucleotide 

polymorphisms (SNPs) identified for transcribed human sequences, as wedl 
as methods of using the nucleic acids. The polymorphisms are arranged in 
the order: 422 nucleotide sequences for SNPs that are silent; 58 
nucleotide sequences for SNPs that lead to conservative amino acid 
changes; 139 nucleotide changes for SNPs that lead to nonconservative 
amino acid changes; and 32 nucleotide sequences for SNPs that involve a 
gap. In particular, the polymorphisms are related to angiopoietin, 
4-hydroxybutyrate dehydrogenase, ATP-dependent RNA helicase, MHC Class I 
histocompatibility antigen, or phosphoglycerate kinase. 

IT 345315-58-4 

RL: BOC (Biological occurrence); BSU (Biological study, unclassified); BUU 
(Biological use, unclassified); PRP (Properties); BIOL (Biological study); 
OCCU (Occurrence); USES (Uses) 

(SNP involving nonconservative amino acid change; nucleic acids contg. 

single nucleotide polymorphisms in the human genome) 
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2001:435102 HCAPLUS 
135:56043 

Compleinentary peptide ligands generated from higher 
eukaryote genoine sequences 

Roberts, Gareth Wyn; Heal, Jonathan Richard 

Proteom Limited, UK 

PCT Int. Appl., 488 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



2001042276 


Al 


20010614 




WO 2000-GB4773 


20001213 
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CH, 


CN, 


CR, 


CU, 
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TR, 
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TZ, 
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US, 


UZ, 


VN, 


YU, 
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BY, 
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RU, 


TJ, 
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SZ, 
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ZW, 
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BE, 


CH, 
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DE, 
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ES, 


FI, 


FR, 


GB, 


GR, 


IE, 


IT, 


LU, 
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PT, 


SE, 


TR, 
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BJ, 
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CG, 
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GA, 
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GW, 


ML, 
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NE, 


SN, 
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TG 






1244691 




Al 


20021002 




EP 2000-96 


J1486 


20001213 






R: AT, 


BE, 


CH, 


DE, 


DK, 


ES, 


FR, 


GB, 


GR, 


IT, 


LI, 


LU, 


NL, 


SE, 


MC, 


PT, 


IE, 


SI, 


LT, 


LV, 


FI, 


RO, 


MK, 


CY, 


AL, 


TR 














APPLN. 


INFO. 












GB 1999- 


29471 


A 


19991213 







WO 2000-GB4773 W 20001213 
AB The invention relates to the identification of complementary peptides from 
the anal, of protein and nucleotide sequence databases from higher 
eukaryote genomes excluding human and plants. These specific 

conriplementary peptides interact with their relevant target proteins 
encoded in the eukaryote genome. Specific complementary peptides to the 
proteins encoded in the eukaryote genome can be used as reagents and drugs 
from drug discovery programs and as lead ligands to facilitate drug design 
and development . 
IT 345600-50-2 345600-52-4 
RL: PRP (Properties) 

(unclaimed sequence; complementary peptide ligands generated from 
higher eukaryote genome sequences) 
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2001:338721 HCAPLUS 
134:364015 

Sequences of antigenic proteins of a group B 
Streptococcus and the genes encoding them and their 
uses in vaccination 

Le Page, Richard William Falla; Wells, Jeremy Mark; 

Hanniffy, Sean Bosco 

Microbial Technics Limited, UK 

PCT Int. Appl., 178 pp. 
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Patent 
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WO 2001032882 
WO 2001032882 

W: CA, CN, 
RW: AT, BE, 
PT, SE 
EP 1214417 

R: AT, BE, 
IE, FI, 
JP 2003527100 
US 2003170782 
PRIORITY APPLN. INFO 



A2 

A3 
JP, US 
CH, CY, DE, 



20010510 
20011115 



WO 2000-GB3437 



20000907 



DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 



A2 20020619 
CH, DE, DK, ES, 
CY 

20030916 
20030911 



FR, 



EP 2000-958822 20000907 
GB, GR, IT, LI, LU, NL, SE, MC, PT, 



T2 
Al 



JP 2001-535564 20000907 
US 2002-91007 20020306 
GB 1999-21125 A 19990907 
WO 2000-GB3437 W 20000907 
AB The invention provides protein and DNA sequences of novel protein antigens 
from Streptococcus agalactiae, a group B Streptococcus. Their use in 
vaccines and screening methods is also described. Gene/partial gene 
sequences putatively encoding exported proteins in S. agalactiae have been 
identified using the nuclease screening system vis the LEEP (Lactococcus 
Expression of Exported Proteins) system. Genes contg. signal sequences 
were identified using a nuclease reporter gene. Tru9I restriction digest 
fragments were cloned upstream of the nuclease gene and transf ormants 
screened using a DNA-Toluidine blue agar overlay which allowed colonies 
secreting the nuclease to be detected by formation of a pink halo. Mice 
vaccinated with a no . of the genes showed statistically significant longer 
survival time than did unvaccinated controls when challenged with. S. 
agalactiae. 
IT 339523-85-2 

RL: BSU (Biological study, unclassified); PRP (Properties); THU 
(Therapeutic use); BIOL (Biological study); USES (Uses) 

(amino acid sequence; sequences of antigenic proteins of group B 
Streptococcus and genes encoding them and their uses in vaccination) 
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2001:319937 HCAPLUS 
134 : 336710 

Protein and cDNA sequences of a novel human zinc 
finger-containing protein 56 (ZFP56) and diagnostic 
and therapeutic uses thereof 
Mao, Yumin; Xie, Yi 

Shanghai Bio Road Gene Development Ltd., Peop. Rep. 
China 



PCT Int. Appl. 
CODEN: PIXXD2 

Patent 

Chinese 

1 



KIND DATE 



32 pp. 



APPLICATION NO. DATE 



WO 2001030827 Al 20010503 

W: AE, AG, AL, AM, AT, AU, 

CU, CZ, DE, DK, DM, DZ, 

ID, IL, IN, IS, JP, KE, 

LV, MA, MD, MG, MK, MN, 

SE, SG, SI, SK, SL, TJ, 

ZA, ZW, AM, AZ, BY, KG, 

RW: GH, GM, KE, LS, MW, MZ, 

DE, DK, ES, FI, FR, GB, 

CF, CG, CI, CM, GA, GN, 

CN 1303936 A 20010718 
PRIORITY APPLN. INFO.: 

AB The invention provides protein and 



AZ, 
EE, 
KG, 

MW, 
TM, 
KZ, 
SO, 
GR, 
GW, 



WO 2O0O-CN393 

BA, BB, BG, BR, BY, 

ES, FI, GB, GD, GE, 

KP, KR, KZ, LC, LK, 

MX, MZ, NO, NZ, PL, 

TR, TT, TZ, UA, UG, 
MD, RU, TJ, TM 

SL, SZ, TZ, UG, ZW, 

IE, IT, LU, MC, NL, 

ML, MR, NE, SN, TD, 
CN 1999-119861 

CN 1999-119861 A 
cDNA sequences for a 



20001027 

BZ, CA, CH, CR, 

GH, GM, HR, HU, 

LR, LS, LT, LU; 

PT, RO, RU, SD, 

US, UZ, VN, YU, 

AT, BE, CH, CY, 

PT, SE, BE, BJ, 

TG 

19991027 
19991027 

novel human zinc 
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AB 



IT 



HCAPLUS COPYRIGHT 2003 ACS on STN 
1995: 69165 HCAPLUS 
122:183539 

Probing the steric tolerance of the insect 
hypertrehalosemic hormone receptor to develop an 
effective photoaf f inity probe 
Hayes, T. K. ; Nails, F. L.; Ford, M. M. 
Institute of Biosciences and Technology, Texas A&M 
University, College Station, TX, 77843, USA 
Pept.: Cham., Struct. Biol., Proc. Am. Pept . Syitip., 
13th (1994), Meeting Date 1993, 666-8. Editor (s): 
Hodges, Robert S.; Smith, John A. ESCOM; Leiden, 
Neth. 

CODEN: 60LXAW 
Conference 
English 

Hypertrehalosemic hormone (HTH) stimulates the insect fat body to 
synthesize and release trehalose into the hemolymph. This study involved 
structure-activity studies in the cockroach Blaberus discoidalis to 
develop a photoaf f inity label to study the properties of HTH receptors. 
Single amino acid replacement analogs helped det. that the most crit. side 
chains were pGlul, Phe4, and Trp8 . The ring nature of the pGlu seems 
important and the aromaticity of the Phe and Trp is required. A series of 
analogs where a single amino acid was replaced with its D-enantiomer were 
compared with the bioassay system to gain initial indications where 
important conformational elements might be located or induced for receptor 
interaction. The loss of activity or potency around the arom. amino acids 
confirms the importance of these residues towards the biol. activity of 
HTH. The relatively high potency of HTH analogs that have D- amino acids 
in positions 6 or 7 is a first indicator that a turn centered around the 
Pro-Gly sequence may be important for the biol. activity of the hormone. 
The design of a photoaf f inity label to better study the properties of HTH 
receptors must include a site for radioiodination . However, attempts to 
either incorporate a site for iodination or to iodinate acceptable sites 
resulted in analogs that were inactive or had unacceptably low potencies. 
The aim of addnl . analogs is to attach radioiodination and 
photocrosslinking sites outside the region of HTH that is crit. for 
receptor interaction. The construction of a branched chain analog at 
position 7 in the D-conf iguration is the best candidate to continue this 
approach . 

161389-51-1 161389-55-5 161510-72-1 

RL: BAG (Biological activity or effector, except adverse); BPR {Biological 
process); BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study); PROC (Process) 

(receptor-binding activity; probing the steric tolerance of the insect 
hypertrehalosemic hormone receptor to develop an effective 
photoaf f inity probe) 
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51 HCAPLUS COPYRIGHT 2003 ACS on STN 
1994 : 603095 HCAPLUS 
121:203095 

Hemagglutinating and chemotactic properties of 
synthetic peptide segments of fimbrial protein from 
Porphyromonas gingivalis 
Ogawa, T,; Hamada, S. 

Fac. Dent., Osaka Univ., Suita, Japan 
Infection and Immunity (1994), 62(8), 3305-10 
CODEN: INFIBR; ISSN: 0019-9567 
Journal 
English 

Porphyromonas gingivalis 381 fimbriae, their synthetic peptide segments, 
and lipopolysaccharide (LPS) were examd. for hemagglutinating the 
migration-stimulating activities, P. gingivalis 381 fimbriae clearly 



SOURCE: 



TYPE: 
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caused hemagglutination, and several oligopeptide segments such as FP381 
(61-80), FP381 (171-185), andFP381 (302-321), agglutinated erythrocytes 
although less effectively than the native fimbriae. Furthermore, P, 
gingivalis 381 LPS but not Escherichia coli 055 :B5 LPS definitely 
exhibited hemagglutination. P, gingivalis fimbriae as well as their 
synthetic peptides possessing hemagglutinating activity enhanced the 
chemotaxically induced migration of human peripheral blood monocytes . The 
results of the analyses using synthetic peptide FP381 (61-80), its related 
compds., and an analog suggested that the amino acid sequence XLTXXLTXXNXX 
within fimbrial protein mols . may play an important role structurally in 
the attachment of the protein to host cells such as erythrocytes and . 
monocytes . 
IT 138166-46-8 

RL: BIOL (Biological study) 

(of fimbrial protein of Porphyromonas gingivalis, hemagglutination by) 
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DOCUMENT NUMBER: 119:49901 

TITLE: Synthesis and conformational studies on peptides 

corresponding to a putative autophosphorylation site 
of abl TPK 

AUTHOR(S): Ruzza, Paolo; Calderan, Andrea; Filippi, Bruno; 

Donella-Deana, Arianna; Pinna, Lorenzo A. ; Borin, 

Gianf ranco 

CORPORATE SOURCE: Dep. Org. Chem. , Univ. Padua, Padua, Italy 

SOURCE: International Journal of Peptide & Protein Research 

(1993), 41(3), 291-9 

CODEN: IJPPC3; ISSN: 0367-8377 
DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB Octapeptide H-Gly-Asp-Thr-Tyr-Thr-Ala-His-Ala-OH, corresponding to the 
sequence of the main putative autophosphorylation site (Tyr515) of the 
transforming gene of Abelson murine leukemia virus (v-abl) which codes for 
a membrane-assocd. tyrosine-specif ic protein kinase (abl TPK) , as well as 
some of its analogs modified in positions -2, -1, +1 and +3, were prepd. 
by classical soln. methods. Anal, of the v-abl gene has shown that both 
the f ibroblast-transf orming and tyrosine-protein kinase activities reside 
within a minimal region encoding . The synthetic peptides were tested as 
substrates for a protein of 43 kDa (p43v-abl) , which represents the most 
active, isolated form of this enzyme. The kinetic data obtained indicate 
that the phosphorylation rates vary considerably, depending on the 
sequence of the peptide, as expected. As a rule, no significant 
incremental efficiency results from each substitution in the parent 
sequence. While the replacement of the two charged residues (Asp2 and 
His7) with neutral Ala is well tolerated, the substitution with amino 
acids bearing opposite charges is detrimental. The correlation between 
secondary structure of the synthetic octapeptides and their substrate 
recognition by p43v-abl was studied using CD and fluorescence 
spectroscopy. A comparison of the spectroscopic data with the kinetic 
parameters does not confirm a close relationship between the 
conformational properties of these peptides and their enzymic role. 
IT 148693-03-2P 

RL: SPN (Synthetic preparation) ; PREP (Preparation) 

(prepn., conformation, and kinetics of phosphorylation of, as model for 

autophosphorylation site of leukemia virus tyrosine-specif ic protein 

kinase ) 
IT 148693-21-4P 

RL: SPN (Synthetic preparation); PREP (Preparation) 

(prepn., deblocking, and peptide coupling reactions of, in prepn. of 

autophosphorylation site model octapeptide) 
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ACCESSION NUMBER: 1993:166893 HCAPLUS 

DOCUMENT NUMBER; 118:166893 

TITLE: Location of the epitope recognized by monoclonal 

antibody 63G on the primary structure of human 
respiratory syncytial virus G glycoprotein and the 
ability of synthetic peptides containing this epitope 
to induce neutralizing antibodies 
Garcia-Barreno, Blanca; Delgado, Teresa; 
Akerlind-Stopner, Britt; Norrby, Erling; Melero, Jose 
A. 

Cent. Nac. Microbiol., Inst. Salud "Carlos III", 
Madrid, 2S220, Spain 

Journal of General Virology (1992), 73(10), 2625-30 
CODEN: JGVIAY; ISSN: 0022-1317 
Journal 
English 

The location of the epitope recognized by monoclonal antibody (MAb) 63G on 
the primary structure of the human respiratory syncytial virus G 
glycoprotein was detd. by testing the reactivity of synthetic peptides 
with the MAb. The role of individual amino acids in this epitope was 
detd. by using a set of 13-mer peptides contg. single residue deletions. 
Residues 204-209 were essential for antibody binding. Several peptides, 
free or bound to keyhole limpet hemocyanin (KLH) , were used to raise 
antisera in rabbits. The antipeptide antibodies reacted with the G 
protein in Western blots. However, only peptide Gl-KLH (residues 187-200 
bound to KLH) induced antibodies that reacted with the intact G protein 
and inhibited infect ivity. 
146646-03-9 

RL: BIOL (Biological study) 

(of G glycoproteins of human respiratory syncytial virus, monoclonal 
antibody recognition of and neutralizing antibodies induction by) 
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117: 185027 

Synthetic analogs of the carboxyl-terminus of 
. beta . -thyrotropin : the importance of basic amino 
acids in receptor binding activity 
Leinung, Matthew C; Bergert, Elizabeth R 
Daniel J.; Morris, John C. 
Dep. Biochem. Mol. Biol., Mayo Clin. Med. 
Rochester, MN, 55901, USA 
Biochemistry (1992), 31(41), 10094-8 
CODEN: BICHAW; ISSN: 0006-2960 
Journal 
English 

Previously, using a synthetic peptide strategy, it was detd. that four 
distinct regions of human .beta.-TSH (.beta.TSH) were responsible for 
interaction of TSH and TSH receptor. The most potent of these four 
regions was the carboxyl-terminus of the subunit, represented by the 
peptide sequence .beta. 101-112, which inhibited binding of radiolabeled 
.beta.TSH to receptor in radio receptor assay with an IC50 of approx. 100 
.mu.M. In the current studies, the native amino acids in region 
.beta . 101-112 were systemically substituted with alanine, and which 
residues within this span are important to the binding activity of TSH to 
its receptor were detd. Substitution of LyslOl, Asnl03, Tyrl04, Cysl05, 
Lysl07 and LysllO with alanine each caused a significant fall in activity 
as compared to the native sequence, whereas substitution at the remaining 
positions had little or no effect. Because three of these residues are 
pos. charged at physiol. pH, it was hypothesized that this charge may be 
important to the binding activity of the sequence. The charge 
characteristics of the region were modified by synthesizing two series of 
analogs in which the residues identified in the alanine substitution 
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studies were substituted with Arg, D-Lys, and D-Arg at each position. In 
addn., a series of analogs contg. basic residues, either added to or 
substituted for nonbasic residues in the sequence . beta . 101-112 , was 
synthesized. Substitution of Arg, D-Lys, and D-Arg for LyslOl, LyslOV, 
and LysllO had little effect on activity; however, inclusion of addnl . 
basic residues in the , beta . 101-112 sequence significantly enhanced the 
inhibitory activity of the region. Substitution of Ala, Ser, Lys or D-Lys 
for Cysl05 resulted in marked redn. in activity. In order to det . if the 
activity of the region was sequence specific or, rather, due to the amino 
acid compn. of the region, the amino acid residues in an addnl. series of 
peptides were reversed and scrambled. The peptide with reversed sequence 
as well as four peptides with scrambled sequence possessed equal activity 
to the native peptide, suggesting that the amino acid compn. and the net 
charge of the region is more important than its specific sequence. Thus, 
a net pos . charge of region 101-112 of .beta.TSH is an important factor in 
the inhibitory activity of peptides representing this portion of the 
hormone. Enhancement of the charge, by addn. of basic residues, may 
increase the potency of interaction of TSH with its receptor. However, 
pos. charge is not all important, as removal of CyslOS results in marked 
loss in activity even though the resulting peptide may have overall 
significantly greater charge. 
IT 143681-09-8 

RL: PROC (Process) 

(as TSH .beta . -subunit C-terminal region analog, TSH receptor binding- 
of, mol . structure in relation to) 
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sequence inhibit adherence of Porphyromonas gingivalis 

AUTHOR(S): Lee, Jin Yong; Sojar, Hakimuddin T.; Bedi, Gurrinder 

S.; Genco, Robert J. 

CORPORATE SOURCE: Dep. Oral Biol., State Univ. New York, Buffalo, NY, 

14214, USA 

SOURCE: Infection and Immunity (1992), 60(4), 1662-70 

CODEN: INFIBR; ISSN: 0019-9567 
DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB Fimbriae are important in the adherence of many bacterial species to the 

surfaces they eventually colonize, Porphyromonas (Bacteroides ) gingivalis 
fimbriae appear to mediate adherence to oral epithelial cells and the 
pellicle-coated tooth surface. The role and contribution of fimbriae "in 
the binding of P. gingivalis to hydroxyapatite (HAP) coated with saliva as 
a model for the pellicle-coated tooth surface were investigated. 
3H-labeled P. gingivalis or the radioiodinated purified fimbriae were 
incubated with 2 mg of HAP beads coated with whole human saliva (sHAP) and 
layered on 100% Percoll to sep. unbound from sHAP-bound components. The 
radioactivity of the washed beads was a measure of the bound components. 
The binding of P. gingivalis 2561 (381) cells and that of purified 
fimbriae were concn. dependent and saturable at approx. 108 cells and 40 
.mu.g of fimbriae added, resp. The addn. of fimbriae inhibited binding of 
P. gingivalis to sHAP beads by 65%, while the 75-kDa protein, which is 
another major surface component of P. gingivalis 2561, did not show 
significant inhibition, suggesting that the fimbriae are important in 
adherence. Encapsulated and sparsely fimbriated P. gingivalis W50 did not 
bind to sHAP beads. On the basis of the predicted sequence of the 
fimbrillin, a structural subunit of fimbriae, a series of peptides was 
synthesized and used to localize the active fimbrillin domains involved in 
P. gingivalis adherence to sHAP beads. Peptides from the 
carboxyl-terminal one-third of the fimbrillin strongly inhibited P. 
gingivalis binding to sHAP beads. Active residues within the sequence of 
inhibitory peptide 226-245 (peptide contg. residues 226 to 245) and 
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peptide 293-306 were identified by using smaller fragments prepd. either 
by trypsin cleavage of the peptide 226-245 or by synthesis of smaller 
segments of peptide 293-306. Hemagglutinin activity, lectinlike binding, 
and ionic interaction did not seem to be involved in this binding since 
lysine, arginine, carbohydrates, and calcium ions failed to affect the 
binding of P. gingivalis. The observation that poly-L-lysine, bovine 
serum albumin, and defatted bovine serum albumin, even at high concns - , 
only partially blocked the binding of P. gingivalis indicates that 
hydrophobic interactions are not the major forces involved in P, 
gingivalis binding to sHAP beads. Protease inhibitors such as EDTA, 
leupeptin, pepstatin, 1, 10-phenanthroline, and phenylmethylsulf onyl 
fluoride did not interfere with the binding of P. gingivalis. However, 
the binding of P. gingivalis to trypsin- or chymotrypsin-pretreated sHAP 
beads was reduced. Overall, these results suggest that fimbrillin has 
domains primarily confined in the carboxyl-terminal region of the protein 
which are responsible for binding P. gingivalis to surface-bound salivary 
components through specific protein-protein interactions. Specific 
f imbrillin-mediated binding may be important in P. gingivalis attachment 
to oral surfaces coated with salivary components. 
IT 141631-38-1 141631-39-2 

RL: BIOL (Biological study) 

(synthetic fimbrillin analog, adherence of Porphyromonas gingivalis 
inhibition by) 
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INVENTOR (S) : Basava, Channa; Hostetler, Karl Y. 

PATENT ASSIGNEE (S) : Vical, Inc., USA 

SOURCE: PCT Int. Appl . , 63 pp. 

CODEN: PIXXD2 
DOCUMENT TYPE: Patent 
LANGUAGE : English 
FAMILY ACC. NUM. COUNT: 3 
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PATENT NO. KIND DATE APPLICATION NO. DATE 



WO 9107978 Al 19910613 WO 1990-US6352 19901031 

W: AU, CA, JP 

RW: AT, BE, CH, DE, DK, ES, FR, GB, GR, IT, LU, NL, SE 
US 5175146 A 19921229 US 1990-572674 19900824 

CA 2069943 AA 19910606 CA 1990-2069943 19901031 

AU 9067512 Al 19910626 AU 1990-67512 19901031 

EP 504168 Al 19920923 EP 1990-917399 19901031 

R: AT, BE, CH, DE, DK, ES, FR, GB, GR, IT, LI, LU, NL, SE 
JP 05504558 T2 19930715 JP 1991-500475 19901031 

PRIORITY APPLN. INFO.: US 1989-446932 19891205 

US 1990-572674 19900824 
WO 1990-US6352 19901031 
OTHER SOURCE(S): MARPAT 116:21468 

AS Y- (Rl-R2-Al-A2-A3-A4-Ser-Thr )mA7-xCT (8-32) [I; Y = a dicarboxylic acid 
residue; Rl = H, H(CH2)nO, H2N(CH2)nO, etc.; R2 - 1-amino-l- 
cyclopropylcarboxylic acid residue, etc.; Al, A7 = Cys, Thr, D-Thr, Tyr, 
etc.; A2 = Gly, Ala, Ser, bond; A3 = Asn, Ser, bond; A4 = Leu, bond; m = 
1, 2; n = 1-22; xCT amino acid sequence corresponding at least at 
residues 9-11, 13-15, 17, 18, 23-26, and 28-32 to murine, salmon, eel,' 
avian, porcine, bovine, ovine, or human calcitonin; provisos apply] were 
prepd. by the solid phase method. E.g., [ (Ser-Thr) 2-Lys-7] -avian 
calcitonin (7-32) was prepd. by the solid phase method starting from 
methylbenzhydrylamine resin-bound B0C-Pro-NH2 using the appropriate 
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protected amino acids. This at 1 .mu.g/kg s.c. effected 2.09 mg Ca 
decrease per dL blood in rats. 
IT 138038-10-5P 

RL: SPN (Synthetic preparation); PREP (Preparation) 
(prepn. of, as hypocalcemic) 
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Immunobiological activities of synthetic peptide 
segments of fimbrial protein from Porphyromonas 

gingivalis 

Ogawa, Tomohiko; Kusumoto, Yutaka; Uchida, Hiroshi; 
Nagashima, Shigeru; Ogo, Hideji; Hamada, Shigeyuki 
Fac. Dent., Osaka Univ., Suita-Osaka, 565, Japan 
Biochemical and Biophysical Research Communications 
(1991), 180(3), 1335^41 
CODEN: BBRCA9; ISSN: 0006-291X 
Journal 
English 

Several oligopeptide segments of fimbrial subunit protein (fimbrilin) of 
P. gingivalis strain 381 were synthesized and tested for immunobiol. 
activities. Peptides F3 (31-50; amino acid residue nos . 31-50, based on 
the amino acid sequence of the fimbrilin proposed by Dickinson et al . , 
(1988), F12 (212-231) and F17 (312-331) were found to be immunodominant 
epitopes of this fimbrial protein as revealed by ELISA. Furthermore, 
peptides F5 (71-90) and F17 (312-331) could agglutinate rabbit erythrocytes, 
and were mitogenic for BALB/c spleen cells but not thymocytes. These 
peptides enhanced the no. of f imbria-specif ic antibody-secreting cells in 
BALB/c spleen cell cultures, and induced cytokines such as tumor necrosis 
factor- . alpha . and interleukin-6 prodn. in human monocyte/macrophage 
cultures. Evidently these defined peptide segments are responsible for 
the immunostimulating portions within the fimbrial protein mol . 
138166-46-8 

RL: BIOL (Biological study) 

(of fimbrial protein, of Porphyromonas gingivalis, immunobiol. of) 
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Somatostatin analogs 

Strachan, Robert G. ; Paleveda, William J.; Veber, 
Daniel F. ; Holly, Frederick W. 
Merck and Co., Inc., USA 
U.S. , 10 pp. 

CODEN: USXXAM 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



US 4162248 
PRIORITY APPLN. 



A 



19790724 



INFO. 



[I; 



US 1978-894266 19780407 

US 1976-732692 19761014 
US 1977-781610 19770328 
AB Somatostatin analogs cyclo (X j -Xlk-X21-Phe-Phe-X3-Lys-Thr-Phe~Thr-X4m) 
X = HN(CH2)nC0 (n - 0-4 ) ; XI - Lys, Lys (INOC) (INOC = 

isonicotinyloxycarbonyl) ; X2 - Asn, Ala, HNCHEtCO; X3 = Trp, D-Trp; X4 = 
Ser, Gly; j, k, 1, and m = 0, 1 except that j-m .noteq. all 0 and all 1; 
the peptide backbone ring must contain 24-33 atoms] and their 
pharmaceutically acceptable salts, which inhibit the release of growth 
hormone, insulin, glucagon, and gastric secretions, were prepd. by 
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solid-phase methods . Thus, R-D-Trp-Lys (INOC) -Thr (CH2Ph) -Phe-Thr (CH2Ph) - 
Asn-Phe-Phe-O-resin {II, R = Me3C02C (BOC) ] was prepd. by stepwise 
solid-phase couplings, BOC-deblocked by CF3C02H to give II (R = H) , and 
then resin-cleaved by H2NNH2 to give H-D-Trp-Lys { INOC) -Thr (CH2Ph) -Phe- 
Thr (CH2Ph) -Asn-Phe-Phe-NHNH2 (III). Ill was converted to the azide, which 
was cyclized to give cyclo [Asn-Phe-Phe-D-Trp-Lys (INOC) -Thr (CH2Ph) -Phe- 
Thr(CH2Ph)], which was deblocked by HF to give cyclo [Asn-Phe-Phe-D-Trp~ 
Lys (INOC) -Thr-Phe-Thr] , which was reduced with Zn/HOAc to give 
cyclo (Asn-Phe-Phe-D-Trp-Lys-Thr-Phe-Thr) , I can be used to treat 
acromegaly, gastric ulcers, and diabetes (no data) . 
IT 71645-27-7P 

RL: SPN (Synthetic preparation) ; PREP (Preparation) 
(prepn. of) 
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f inger-contg . protein 56 (ZFP56), which is a novel member of Kruppel 
protein family. The ZFP56 shares sequence homol. with human zinc 
f inger-contg . protein 131 (2FP131). The invention also relates to 
constructs and methods to express the cloned gene for the prepn. of its 
protein and antibodies using E.coli cells or eukaryotic cells,, and 
diagnostic and therapeutic uses for ZFP56 related diseases. 
337966-36-6 
RL: PRP (Properties) 

(unclaimed sequence; protein and cDNA sequences of a novel human zinc 
f inger-contg . protein 56 (ZFP56) and diagnostic and therapeutic uses 
thereof ) 
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Mathematical modeling of insect neuropeptide 
potencies. Are quantitatively predictive models 
possible? 

Lee, M. J./ de Jong, S,/ Gade, G. ; Poulos, C; 
Goldsworthy, G. J. 

Biotechnology, Unilever Research Vlaardingen, 
Vlaardingen, 3133 AT, Neth, 

Insect Biochemistry and Molecular Biology (2000), 
30(10), 899-907 

CODEN: IBMBES; ISSN: 0965-1748 
Elsevier Science Ltd. 
Journal 
English 

The potencies of natural adipokinetic hormones and synthetic variants have 
been detd. in Locusta migratoria using the lipid mobilization assay in 

vivo, and/or the acetate uptake assay in vitro. These data are 
combinations of previously published and unpublished data (a total of 
sixty-nine analogs), and form data sets for the construction of math, 
models of the hormone potencies. The sequence variations of amino acids 
in both natural and artificial adipokinetic hormone analogs were described 
using continuous descriptor scales zl', 22', and 23', each previously 
published scale being derived from various properties of the amino acids. 
By means of these z' -scales and partial least squares regression we 
attempted to model the potencies in Locusta migratoria of adipokinetic 
hormones in the two assays. Correlations (r2 values) between predicted 
and actual potencies of the different peptides were up to 0.73. We 
discuss the potential of the partial least squares method for formulating 
quant, relationships between different hormone structures and their 
potencies, and describe how the procedure might be used in 
structure-activity prediction with the construction of an optimized 
peptide data set. 
304868-61-9 

RL: BAC (Biological activity or effector, except adverse) / BSU (Biological 
study, unclassified); PRP (Properties); BIOL (Biological study) 

(math, models for QSAR of adipokinetic hormone analogs in Locusta 

migratoria) 

25 THERE ARE 25 CITED REFERENCES AVAILABLE FOR THIS 
RECORD, ALL CITATIONS AVAILABLE IN THE RE FORMAT 



IT 



REFERENCE COUNT: 



L3 ANSWER 33 OF 51 
ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 



INVENTOR (S) : 



HCAPLUS COPYRIGHT 2003 ACS on STN 
2000: 1614 23 HCAPLUS 
132:204851 

Human ubiquitin ligases substrate-targeting subunits 
and their cDNA sequences and uses as therapeutic 
targets 

Chiaur, Dah Shiarn; Pagano, Michele; Latres, Esther 
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AB The present invention relates to the discovery, identification, and 
characterization of human cDNA sequences that encode 26 novel 
substrate-targeting subunits of ubiquitin ligases . The invention 
encompasses nucleic acids encoding the novel substrate-targeting subunits 
of ubiquitin ligases, transgenic mice, knock-out mice, host cell 
expression systems, and proteins encoded by the nucleic acids of the 
present invention. The ubiquitin ligase subunits (FBPs) each contain an F 
box motif through which they interact with the other components of the 
ubiquitin ligase complex. In addn., some of these FBPs contain WD--40 
domains and LRRs (which appear to be involved in their interaction with 
substrates), while other FBPs contain potential potential protein-protein 
interaction modules not yet identified in FBPs, such as leucine zippers, 
ring fingers, helix-loop-helix motifs, proline-rich motifs, and SH2 
domains. The invention is also based, in part, on the discovery and 
identification of FBP-specific substrates p27 and . beta . -catenin and on 
methods to identify novel FBP substrates. Some of the genes encoding the 
novel F box proteins were also mapped to chromosome sites frequently 
altered in various malignant disorders. The present invention relates to 
screening assays that use the novel substrate-targeting subunits to 
identify potential therapeutic agents such as small mols., compds . or 
derivs. and analogs of the novel ubiquitin ligases which modulate activity 
of the novel ubiquitin ligases for the treatment of proliferative and 
dif f erentiative disorders, such as cancer, major opportunistic infections, 
immune disorders, certain cardiovascular diseases, and inflammatory 
disorders. The invention further encompasses therapeutic protocols and 
pharmaceutical compns . designed to target ubiquitin ligases and their 
substrates for the treatment of proliferative disorders. 

IT 260388-05-4 

RL: PRP (Properties) 

(unclaimed sequence; human ubiquitin ligases substrate-targeting 
subunits and their cDNA sequences and uses as therapeutic targets) 
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1998:734986 HCAPLUS 
130 : 10630 

Factor Vila inhibitors from Kunit z-domain proteins 
Dennis, Mark S.; Lazarus, Robert A, 
Genentech Inc., USA 

U.S., 45 pp.. Cent .-in-part of U.S. Ser. No. 206,310, 

CODEN: USXXAM 

Patent 

English 

4 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



US 5834244 
US 5795954 
CA 2184058 

PRIORITY APPLN. 

OTHER SOURCE (S) 

AB A potent s 
Factor XIa 
provided i 
inhibition 
indicated . 

IT 171281-94- 
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(Process ) ; 
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REFERENCE COUNT 



A 19981110 US 1995-398010 19950303 

A 19980818 US 1994-206310 19940304 

AA 19950908 CA 1995-2184058 19950303 

INFO,: US 1994-206310 19940304 

MARPAT 130:10 630 
erine protease inhibitor capable of inhibiting Factor Vila, 

plasma kailikrein, or plasmin is provided. The inhibitor is 
n a pharmaceutical compn. for treatment of diseases where 
of Factor Vila, Factor XIa, plasma kailikrein, or plasmin is 

iological occurrence); BSU (Biological study, unclassified); PEP 

engineering or chemical process) ; PRP (Properties) ; THU 
ic use); BIOL (Biological study); OCCU (Occurrence); PROC 
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Vila inhibitors from Kunit z-domain proteins) 
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Patent 
English 
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PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



US 5786328 A 19980728 US 1995-463432 19950605 

PRIORITY APPLN. INFO.: US 1995-463432 19950605 

OTHER SOURCE(S): MARPAT 129:156937 

AB Potent and specific serine protease inhibitors are provided that are 

capable of inhibiting plasma kailikrein. The inhibitors are provided in 
pharmaceutical compn s . for the treatment of diseases and disorders where 
inhibition of plasma kailikrein in indicated. 

IT 185557-76-0 

RL: BAG (Biological activity or effector, except adverse); BSU (Biological 
study, unclassified); PRP (Properties); THU (Therapeutic use); BIOL 
(Biological study); USES (Uses) 
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1998: 163759 HCAPLUS 
128:228247 

Tumor-associated proteins for development of 

iimtiunoassays for detecting cervical cancer 

Keesee, Susan K. ; Obar, Robert; Wu, Ying-Jye 

Matritech, Inc., USA 

PCT Int. Appl., 7 9 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



WO 9809170 

WO 9809170 
W: AU, 
RW: AT, 

US 5858683 

AU 9740732 

EP 923740 

R: AT, 
IE, 

JP 2001500609 
US 6027905 
US 2003157482 
PRIORITY APPLN. INFO 



A2 19980305 
A3 19980423 

CA, JP 

BE, CH, DE, DK, ES, 
A 19990112 
Al 19980319 
A2 19990623 
CH, DE, DK, ES, 



BE, 
FI 



FR, 



APPLICATION NO. DATE 



WO 1997-US14526 19970819 



FR, GB, GR, IE, IT, LU, MC, NL, PT, SE 

US 1996-705660 19960830 

AU 1997-40732 19970819 

EP 1997-938400 19970819 
GB, GR, IT, LI, LU, NL, SE, MC, PT, 



T2 

A 

Al 



20010116 
20000222 
20030821 



US 
WO 
US 



JP 1998-511706 
US 1997-989045 
US 1999-315355 

1996- 705660 A 

1997- US14526 W 
1997-989045 A3 



19970819 
19971211 
19990517 

19960830 
19970819 
19971211 



AB The invention provides a wide range of methods and compns . for detecting 
and treating cervical cancer in an individual. Specifically, the 
invention provides target cervical cancer-assocd. proteins, which permit a 
rapid detection, preferably before metastases occur, of cervical cancer. 
The target cervical cancer-assocd. protein, may be detected, for example, 
by reacting the sample with a labeled binding moiety, for example, a 
labeled antibody capable of binding specifically to the protein. The 
invention also provides kits useful in the detection of cervical cancer in 
an individual. In addn., the invention provides methods utilizing the 
cervical cancer-assocd. proteins either as targets for treating cervical 
cancer or as indicators for monitoring of the efficacy of such a 
treatment . 

IT 204440-33-5 

RL: ANT (Analyte) ; BOC (Biological occurrence); BSU (Biological study, 
unclassified); PRP (Properties);. THU (Therapeutic use); ANST (Analytical 
study); BIOL (Biological study); OCCU (Occurrence); USES (Uses) 

(amino acid sequence; tumor-assocd. proteins for development of 

immunoassays for detecting cervical cancer) 
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Fusion proteins of a tissue factor and a Kunitz 
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F. 
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DOCUMENT TYPE: 
LANGUAGE : 
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PCT Int. Appl, 

CODEN: PIXXD2 

Patent 

English 

1 



57 pp. 



PATENT NO. 



KIND DATE 



WO 



9720939 
W: AL, 
DK, 



AM, 
EE, 
LR, 
RU, 
BY, 
LS, 
IT, 
NE, 



Al 19970612 
AT, AU, AZ, BA, 



LK, 

RO, 

AZ, 
RW: KE, 

IE, 

MR, 
US 5736364 
AU 9710234 
EP 866870 
EP 866870 

R: AT, BE, 

IE, FI 
JP 2000503528 
AT 234359 
US 5874407 
PRIORITY APPLN. INFO. 



ES, 

LS, 

SD, 

KG, 

MW, 

LU, 

SN, 
A 
Al 
Al 
Bl 



FI, 
LT, 
SE, 
KZ, 
SD, 
MC, 
TD, 



GE, 
LV, 
SI, 
RU, 



GB, 
LU, 
SG, 
MD, 
SZ, 
NL, 
TG 

19980407 
19970627 
19980930 

20030312 



BB, 
HU, 
MD, 
SK, 
TJ, 



APPLICATION NO. DATE 

WO 1996-US18756 19961122 
BG, BR, BY, CA, CH, CN, CU, CZ, 
IS, JP, KE, KG, KP, KR, KZ , 
MK, MN, MW, MX, NO, NZ, PL, 
TM, TR, TT, UA, UG, UZ, VN, 



UG, AT, 
PT, SE, 



IL, 
MG, 
TJ, 
TM 
BE, 



CH, 



BF, BJ, 



DE, DK, 
CF, CG, 



US 1995-566800 
AU 1997-10234 
EP 1996-940592 



ES, FI, FR, GB, 
CI, CM, GA, GN, 

19951204 
19961122 
19961122 



DE, 

LC, 
PT, 
AM, 

GR, 
ML, 



CH, DE, DK, ES, FR, GB, GR, IT, LI, LU, NL, SE, MC, PT, 



T2 

E 

A 



20000328 
20030315 
19990223 



US 
US 



JP 1997-521312 
AT 1996-940592 
US 1997-932589 
1995-566459 A 
1995-566800 A 



19961122 
19961122 
19970917 
19951201 
19951204 
19961122 



WO 1996-US18756 W 

AB Fusion proteins of Kunitz proteinase inhibitors and tissue factor that are 
useful as inhibitors of factor VII activation and of factor Vila-dependent 
processes in the extrinsic pathway of blood coagulation are described for 
therapeutic use. Inhibitor domains from a broad array of Kunitz 
inhibitors can be used. A series of fusion proteins of tissue factor and 
Kunitz domains were prepd. by expression of the cloned gene in Escherichia 
coli. The proteins retained the structural integrity of the tissue factor 
domain, as shown by monoclonal antibody binding, and trypsin inhibition 
activity. The fusion proteins tested also inhibited factor Vila-dependent 
activation of factor IX and several significantly prolonged clotting in an 
activated partial thromboplastin time assay. 

IT 171281-94-OD, fusion with factor VII 

RL: BAC (Biological activity or effector, except adverse); BSU (Biological 
study, unclassified); PRP (Properties); THU (Therapeutic use); BIOL 
(Biological study); USES' (Uses) 

(fusion proteins of tissue factor and Kunitz inhibitor for inhibition 
of factor Vila and extrinsic pathway) 
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Antagonistic effect of synthetic peptides 
corresponding to the binding regions within fimbrial 
subunit protein from Porphyromonas gingivalis to human 
gingival fibroblasts 

Ogawa, Tomohiko; Ogo, Hideji; Kinoshita, Akihiro 

Department of Oral Microbiology, Osaka University 

Faculty of Dentistry, Osaka, 565, Japan 

Vaccine (1997), 15(2), 230-236 

CODEN: VACCDE; ISSN: 0264-410X 
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Journal 
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AB Specific binding region within fimbrial subunit protein (fimbrilin) from 

Porphyromonas gingivalis strain 381 was studied in cultured human gingival 
fibroblasts. Fluorescent micrographs visualized FITC-labeled fimbriae of 
P. gingivalis specifically bound to normal human fibroblast cell line 
(Gin-1) along the cell surface. Flow cytometric anal, also revealed the 
binding of FITC-labeled fimbriae to Gin~l cells. Synthetic peptides 
composed of residues 1-20 (AFGVGDDESKVAKLTVMVYN) of the fimbrilin from P. 
gingivalis, FP381 (1-20), FP381 (69-80; ALTTELTAENQE ) and FP381 (171-181; 
DA-NYLTGSLTT) definitely inhibited P. gingivalis f imbria-binding to Gin-1 
cells by ELISA» Furthermore, based on the Scatchard plot anal, of the 
binding of 1251-labeled P. gingivalis fimbriae to Gin-1 cells, the 
apparent dissocn. const. (Kd) was calcd. as 15.9 pM, and the no. of 
binding sites (Rt) was estd. as 150 sites/cell. Binding studies of 
1251-labeled FP381 (171-181) also revealed the presence of a 
non-interacting, single class of affinity binding sites: the apparent Kd 
and Rt were 29.2 nM and 18440 sites/cell on Gin-1 cells, resp. These 
results demonstrate that specific binding regions on P. gingivalis 
fimbriae to human gingival fibroblasts are present, and certain 
corresponding peptides clearly inhibited the binding of P. gingivalis 
fimbriae to human gingival fibroblasts. 

IT 138166-46-8P 

RL: BPR (Biological process); BSU (Biological study, unclassified); SPN 
(Synthetic preparation); BIOL (Biological study); PREP (Preparation); PROC 
(Process ) 

(antagonistic effect of synthetic peptides corresponding to the binding 
regions within fimbrial subunit protein from Porphyromonas gingivalis 
to human gingival fibroblasts) 
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Kunitz-type plasma kallikrein inhibitors comprising 
non-native protein domains, pharmaceutical uses, and 

expression vectors 

Dennis, Mark S.; Lazarus, Robert A. 

Genentech, Inc., USA 

POT Int. Appl. , 70 pp. 
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Patent 
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PATENT NO. 
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OTHER SOURCE (S) 



US 1995-463155 A 19950605 
WO 1996-US9059 W 19960604 
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AB Potent and specific serine protease inhibitors are provided that are 
capable of inhibiting plasma kallikrein. The inhibitors comprise a 
non-native Kunitz type serine protease inhibitor domain. The inhibitors 
are provided in pharmaceutical compns . for the treatment of diseases and 
disorders where inhibition of plasma kallikrein is indicated. Expression 
vectors are also included. 

IT 185557-76-ODP, peptide derivs . 

RL: BPN (Biosynthetic preparation); FRF (Properties); THU (Therapeutic 
use); BIOL (Biological study); PREP (Preparation); USES (Uses) 

(Kunitz-type plasma kallikrein inhibitors comprising non-native protein 
domains, pharmaceutical uses, and expression vectors) 
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Fimbrial polypeptides useful in the prevention of - 
periodontitis 

Evans, Richard T.; Bedi, Gurrinder S.; Genco, Robert 

J.; Sojar, Hakimuddin T. 

State University of New York, USA 

U.S. , 23 pp. 

CODEN : USXXAM 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



US 5536497 A 19960716 US 1992-994277 19921221 

PRIORITY APPLN. INFO.: US 1992-994277 19921221 

AB Polypeptides related to fimbriae of Porphyromonas gingivalis are described 
and claimed which exhibit inhibition of bacterial adhesion to 
saliva-coated surfaces. The polypeptides are selected from the group 
consisting of fimbriae, fimbrillin, and f imbrial-related . peptides derived 
therefrom. The polypeptides are used as active ingredients in various 
oral formulations designed to prevent adhesion of P. gingivalis to host 
mucosal surfaces and thus interfering with the development of 
periodontitis. The polypeptides are also used in subunit vaccine 
formulations for use against pathogenic, fimbriated P. gingivalis in the 
prophylactic treatment of periodontitis. Use of the polypeptides for 
inducing protective immunity in serum and gingival crevicular fluid may 
prevent primary infection with P. gingivalis as well as the spread of the 
organism between intraoral reservoirs. 
IT 141631-38-lP 

RL: BAC (Biological activity or effector, except adverse) ; BSU (Biological 
study, unclassified) ; PNU (Preparation, unclassified) ; PRP (Properties) ; 
THU (Therapeutic use); BIOL (Biological study); PREP (Preparation); USES 
(Uses) 

(fimbrial polypeptides useful in the prevention of periodontitis) 
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Kunitz domain inhibitor proteins derived from 

Alzheimer's amyloid . beta . -protein precursor inhibitor 

Dennis, Mark S.; Lazarus, Robert A. 

Genentech, Inc., USA 

PCT Int. Appl., 82 pp. 

CODEN: PIXXD2 

Patent 

English 
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PATENT NO. 
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APPLICATION NO. 



DATE 



WO 9523860 
WO 9523860 
W: OA, 
RW: AT, 
US 5795954 
CA 2184058 
EP 748380 

R: AT, 
JP 09509838 
PRIORITY APPLN. 



A2 19950908 
A3 19950928 

JP, MX 

BE, CH, DE, DK, ES, 
A 19980818 
AA 19950908 
Al 19961218 
CH, DE, DK, ES, 
T2 19971007 



WO I995-US2637 



19950303 



BE, 



FR, GB, GR, IE, IT, LU, MC, NL, PT, SE 
US 1994-206310 19940304 
CA 1995-2184058 19950303 
EP 1995-913532 19950303 
FR, GB, GR, IE, IT, LI, LU, MC, NL, PT, SE 
JP 1995-523045 19950303 
INFO.: US 1994-206310 19940304 

WO 1995-US2637 19950303 
AB A potent serine protease inhibitor capable of inhibiting Factor Vila, 

Factor XIa, plasma kallikrein, or plasmin derived from Alzheimer's Amyloid 
beta-Protein Precursor Inhibitor (APPI) is provided. The inhibitor is 
provided in a pharmaceutical compn. for treatment of diseases where 
inhibition of Factor Vila, Factor XIa, plasma kallikrein, or plasmin is 
indicated. 
IT 171281-94-0 

RL: BPR (Biological process); BSU (Biological study, unclassified); THU 
(Therapeutic use) ; BIOL (Biological study) ; PROC (Process) ; USES (Uses) 

(Kunitz domain inhibitor proteins derived from Alzheimer's amyloid 

.beta . -protein precursor inhibitor) 
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1995:260464 HCAPLUS 
122:53472 

The potential protective immune responses to synthetic 
peptides containing conserved epitopes of 
Porphyromonas gingivalis fimbrial protein 
Ogawa, T. 

Faculty of Dentistry, Osaka University, Suita-Osaka, 

565, Japan 

Journal of Medical Microbiology (1994), 41(5), 349-58 
CODEN; JMMIAV; ISSN; 0022-2615 
Journal 
English 

The immunodominant and T-cell epitopes within the fimbrial subunit protein 
(fimbrillin) of P. gingivalis strain 381 were analyzed by multi-pin 
peptide synthesis technol. Six regions with immunodominant epitopes 
within a sequence of 337 amino acids that reacted with the serum of 
patients with adult periodontitis were detected. T cells from mice 
immunized with P. gingivalis fimbriae exhibited proliferative responses to 
P. gingivalis fimbriae or to six 10-mer synthetic peptides from the amino 
acid sequence of the fimbrillin. Three synthetic peptides that contained 
the regions responsible for the immunodominant epitopes as well as those 
which coincided with a T-cell epitope of P. gingivalis fimbrial mols.: 
FP381 (142-161) , FP381(202-221) , and FP381( 216-243) , were selected and 
synthesized. When guinea pigs were immunized with fimbriae or one of the 
3 synthetic peptide segments and an adjuvant in Freund's incomplete 
adjuvant, enhanced prodn. of the antigen-specific IgG antibodies was 
induced in the serum of the animals. Furthermore, of the 3 synthetic 
peptides tested, FP381 ( 202-221 ) produced the greatest protective immune 
response in guinea pigs infected with P. gingivalis and this was more " 
effective than the native fimbrial protein. 
159928-16-2 
RL: PRP (Properties) 

(in epitope mapping on Porphyromonas gingivalis fimbrial protein 
fimbrillin) 
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Crossover limits have been increased. See HELP CROSSOVER for details. 

Experimental and calculated property data are now available. See HELP 
PROPERTIES for more information. See STNote 27, Searching Properties 
in the CAS Registry File, for complete details: 
http : //www . cas . org/ONLINE/STN/STNOTES/stnotes27 . pdf 

-> d .seq 12 tot 

L2 ANSWER 1 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 549494-95-3 REGISTRY 

CN L-Cysteine, L- cyst einyl-L- cyst einyl-L-pheny lal any 1-L-threonyl -L- 

phenylalanyl-L-seryl-L-seryl-L-lysyl-L-lysyl-L~isoleucyl-L-seryl-L-leucyl- 
L-glutaminyl-L-arginyl-L-leucyl-L-lysyl-L-seryl-L-tyrosyl-L-valyl-L- 
isoleucyl-L-threonyl-L~threonyl-L-seryl-L-arginyl- (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 21: PN: WO03051921 FIGURE: 9 unclaimed sequence 
SQL 25 
SQL 25 

SEQ 1 CCFTFSSKKI SLQRLKSYVI TTSRC 



HITS AT: 4-8 

REFERENCE 1: 139:67789 

L2 ANSWER 2 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 521939-37-7 REGISTRY 

CN L-Lysine, L-leucyl-L-isoleucyl-L~cysteinyl-L- . alpha . -glutamyl-L~alanyl-L- 
threonyl-L-asparaginyl-L-phenyialanyl-L-seryl-L-prolyl- ( 9CI ) (CA INDEX 
NAME) 

OTHER NAMES: 

CN 123: PN: WO03038444 SEQID: 236 unclaimed sequence 
SQL 11 
SQL 11 

SEQ 1 LICEATNFSP K 

HITS AT: 7-11 

REFERENCE 1: 138:362621 

L2 ANSWER 3 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 494863-53-5 REGISTRY 

CN L-Lysine , L- . alpha . -glutamyl-L- . alpha . -glutamylglycyl-L-asparaginyl-L- 

tyrosyl-L-threonyl-L-prolyl~ (9CI) (CA INDEX NAME.) 
OTHER NAMES: 

CN 20: PN: WO03010186 SEQID: 20 claimed protein 
SQL 8 
SQL 8 

SEQ 1 EEGNYTPK 



HITS AT: 4-8 

REFERENCE 1: 138:149946 

L2 ANSWER 4 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 474899-11-1 REGISTRY 

CN L-Serine, L-valyl-L-prolyl~L-tyrosyl-L-threonyl-L-phenylalanyl-L-seryl-L- 

seryl-L-lysyl- (9CI) (CA INDEX NAME) 
SQL 9 
SQL 9 
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SEQ 1 VPYTFSSKS 

HITS AT: 4-8 
REFERENCE 1 : 137 : 380984 
REFERENCE 2: 137:380982 

L2 ANSWER 5 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 

RN 474842-20-1 REGISTRY 

ON L-Asparagine, L-threonyl-L-phenylalanyl-L-seryl-L-seryl-L-lysyl-L-seryl-L- 

prolylglycyl- (9CI) (CA INDEX NAME) 
SQL 9 
SQL 9 

SEQ 1 TFSSKSPGN 

HITS AT: 1-5 

REFERENCE 1: 137:380980 

L2 ANSWER 6 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473791-54-7 REGISTRY 

CN L-Asparagine, L-tyrosyl-L-threonyl-L-phenylalanyl-L-seryl-L-seryl-L-lysyl- 

L-seryl-L-prolylglycyl- (9CI) {CA INDEX NAME) 
SQL 10 
SQL 10 

SEQ 1 YTFSSKSPGN 

HITS AT: 2-6 

REFERENCE 1: 137:321378 

L2 ANSWER 7 OF '59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473790-80-6 REGISTRY 

CN L-Lysine, L-glutaminyl-L-glutaminyl-L-valyl-L-prolyl-L-tyrosyl-L-threonyl- 

L-phenylalanyl-L-seryl-L-seryl- (9CI) (CA INDEX NAME) 
SQL 10 
SQL 10 

SEQ 1 QQVPYTFSSK 

HITS AT: 6-10 



REFERENCE 1 
REFERENCE 2 
REFERENCE 3 



137:380979 
137:380977 
137:321378 



L2 ANSWER 8 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473469-67-9 REGISTRY 

CN L-Serine, L-gl utaminyl-L-val yl~L-prolyl-L~ tyros yl-L-threonyl -L- 
phenylalanyl-L-seryl-L-seryl-L-lysyl-L-seryl-L-prolylglycyl-L- 
asparaginylglycyl- (9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 QVPYTFSSKS PGNGS 

HITS AT: 5-9 
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REFERENCE 1: 137:32137 6 

L2 ANSWER 9 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473469-46-4 REGISTRY 

CN L-Alanine, L-threonyl-L-phenylalanyl-L-seryl-L-seryl-L-lysyl-L-seryl-L- 
prolylglycyl-L-asparaginylglycyl-L-seryl-L-leucyl-L-arginyl-L~ . alpha . - 
glutamyl- (9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 TFSSKSPGNG SLREA 



HITS AT: 1-5 

REFERENCE 1: 137:32137 6 

L2 ANSWER 10 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473469-21-5 REGISTRY 

CN L-Proline, L-tyrosyl~L-leucyl-L- . alpha . -aspartyl-L-glutaminyl-L-glutaminyl- 
L-valyl-L-prolyl-L-tyrosyl-L-threonyl~L-phenylalanyl-L-seryl-L-seryl-L- 
lysyl-L-seryl- (9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 YLDQQVPYTF SSKSP 



HITS AT: 9-13 

REFERENCE 1: 137:321376 

L2 ANSWER 11 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473469-07-7 REGISTRY 

CN Glycine, L-glutaminyl-L-glutaminyl-L-valyl-L-prolyl-L-tyrosyl-L-threonyl-L- 
phenylalanyl-L-seryl~L-seryl-L-lysyl-L-seryl-L-prolylglycyl-L-asparaginyl- 
{9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 QQVPYTFSSK SPGNG 



HITS AT: 6-10 

REFERENCE 1: 137:32137 6 

L2 ANSWER 12 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473469-06-6 REGISTRY 

CN L-Serine, glycyl-L-tyrosyl-L-leucyl-L- . alpha . -aspartyl-L-glutaminyl-L- 

glutaminyl-L-valyl-L-prolyl-L-tyrosyl-L-threonyl-L-phenylalanyl-L-seryl-L- 
seryl-L-lysyl- (9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 GYLDQQVPYT FSSKS 

HITS AT: 10-14 
REFERENCE 1: 137:321376 

L2 ANSWER 13 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473468-80-3 REGISTRY 

CN L-Arginine, L-prolyl-L-tyrosyl-L-threonyl~L-phenylalanyl-L-seryl~L-ser,yl-L- 
lysyl-L-seryl-L-prolylglycyl-L-asparaginylglycyl-L-seryl-L-leucyl- ( 9CI ) 
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(CA INDEX NAME) 
SQL 15 
SQL 15 

SEQ 1 PYTFSSKSPG NGSLR 

HITS AT: 3-7 

REFERENCE 1: 137:321376 

L2 ANSWER 14 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473468-36-9 REGISTRY 

ON L-Glutamic acid, L-tyrosyl-L-threonyl-L-phenylalanyl-L-seryl-L-seryl-L- 
lysyl-L-seryl-L-prolylglycyl-L-asparaginylglycyl-L-seryl-L-leucyl-L- 
arginyl- (9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 YTFSSKSPGN GSLRE 



HITS AT: 2-6 

REFERENCE 1: 137:321376 

L2 ANSWER 15 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 4734 68-22-3 REGISTRY 

CN L-Asparagine, L- . alpha . -aspartyl-L-glutaminyl-L-glutaminyl-L-valyl-L- 

prolyl-L-tyrosyl-L-threonyl~L-phenylalanyl-L-seryl-L-seryl-L-lysyl-L-seryl- 
L~prolylglycyl- (9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 DQQVPYTFSS KSPGN 

HITS AT: 7-11 

REFERENCE 1: 137:32137 6 

L2 ANSWER 16 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473468-12-1 REGISTRY 

CN L-Lysine, L-alanylglycyl-L-tyrosyl-L-leucyl-L- . alpha . -aspartyl-L- 
glutaminyl-L-glutaminyl-L-valyl'-L-prolyl-L- tyros yl-L-threonyl-L- 
phenylalanyl-L-seryl-L-seryl- (9CI) (CA INDEX NAME) 

SQL 15 

SQL 15 

SEQ 1 AGYLDQQVPY TFSSK 



HITS AT: 11-15 
REFERENCE 1 : 137 : 321376 

L2 ANSWER 17 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 473328-98-2 REGISTRY 

CN L-Lysine, L~gl utaminyl-L-val yl-L-prolyl-L- tyros yl-L-threonyl -L- 

phenylalanyl-L-seryl-L-seryl- {9CI) (CA INDEX NAME) 
SQL 9 
SQL 9 

SEQ 1 QVPYTFSSK 



HITS AT: 5-9 
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REFERENCE 
REFERENCE 
REFERENCE 



137 : 380978 
137:380976 
137 : 334071 



L2 ANSWER 18 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 460039-30-9 REGISTRY 

CN L-Valine, L~ ♦ alpha . -glutamyl-L-isoleucyl-L-tryptophyl-L-threonyl-L- 
phenylalanyl-L-seryl-L-threonyl-L-lysyl- (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 5: PN: WO02072766 SEQID; 5 claimed sequence 
SQL 9 
SQL 9 

SEQ 1 EIWTFSTKV 

HITS AT: 4-8 

REFERENCE 1: 137:24 652 9 

L2 ANSWER 19 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 

RN 408305-90-8 REGISTRY 

CN L-Glut amine, L-glutaminyl-L-seryl-L-lysyl-L-threonyl-L-phenylalanyl-L- 
threonyl~L-alanyl-L~lysyl-L-seryl-L- . alpha . -aspartyl~L-phenylalanyl-L- 
seryl- (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 27: PN: WO0226987 SEQID: 27 unclaimed sequence 
SQL 13 
SQL 13 

SEQ 1 QSKTFTAKSD FSQ 

HITS AT: 4-8 



REFERENCE 



136: 290007 



L2 ANSWER 20 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 387817-80-3 REGISTRY 

CN L-Leucine, L-alanyl-L-valyl-L-valyl-L-asparaginyl-L-valyl-L-threonyl-L- 

tyrosyl-L-seryl-L-seryl-L-lysyl-L- , alpha . -aspartyl-L-glutaminyl-L-alanyl- 
arginyl-L-glut aminyl-L-alanyl-L-leucyl-L- . alpha . -aspartyl-L-lysyl- ( 9CI ) 

(CA INDEX NAME) 



OTHER NAMES: 



CN 

CN 

CN 

CN 

CN 

CN 

SQL 

SQL 

SEQ 



399: 

403: 

405: 

405: 

406: 

54 : 

20 

20 



PN: 
PN: 
PN: 
PN: 
PN: 
PN: 



US20020115139 SEQID: 
US20020052329 SEQID: 
US20030138438 SEQID: 



405 claimed protein 
405 claimed sequence 
4 05 unclaimed protein 



WO0200174 SEQID: 405 unclaimed sequence 
US20020147143 SEQID: 405 unclaimed sequence 
US20030064947 SEQID: 405 claimed sequence 



1 AVVNVTYSSK DQARQALDKL 



HITS AT: 6-10 
REFERENCE 1 
REFERENCE 2 
REFERENCE 3 



139: 132437 
138:302631 
137:290681 
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REFERENCE 4: 137:196735 
REFERENCE 5; 136:354185 
REFERENCE 6: 136:101081 

L2 ANSWER 21 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 375806-38-5 REGISTRY 

ON L-Tryptophan, L-threonyl-L-tyrosyl-L-threonyl-L-prolyl-L-lysyl-L-lysyl-L- 
seryl- (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 2842: PN: WO0131019 PAGE: 660 claimed protein 
CN 810: PN: WO0131019 PAGE: 365 claimed sequence 
SQL 8 
SQL 8 

SEQ 1 TYTPKKSW 



HITS AT: 1-5 

REFERENCE 1: 136:117375 

REFERENCE 2: 136:4714 

L2 ANSWER 22 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350581-18-9 REGISTRY 

CN L-Threonine, L-alanyl-L-phenylalanyl-L-asparaginyl-L-tyrosyl-L-threonyl-L- 

seryl-L-lysyl-L-asparaginyl-L-seryl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 77: PN: WO014 9830 PAGE: 62 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 AFNYTSKNST 



HITS AT: 3-7 

REFERENCE 1: 135:118780 

L2 ANSWER 23 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350473-71-1 REGISTRY 

CN L-Lysine, L-alanyl-L-phenylalanyl-L-phenylalanyl-L- . alpha . -aspartyl-L~ 
lysyl-L-threonyl-L-tyrosyl-L-threonyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 112: PN: WO0149834 SEQID: 283 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 AFFDKTYTAK 



HITS AT: 6-10 

REFERENCE 1 : 135 : 106297 

L2 ANSWER 2 4 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350473-16-4 REGISTRY 

CN L-Threonine, L-lysyl-L-threonyl-L-phenylalanyl-L-threonyl-L~alanyl-L-lysyl- 

L-. alpha. -glutamyl-L-alanyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 40: PN: WO0149834 SEQID: 187 unclaimed sequence 
SQL 10 
SQL 10 
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SEQ 1 KTFTAKEAAT 



HITS AT: 2-6 

REFERENCE 1: 135:106297 

L2 ANSWER 25 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350473-13-1 REGISTRY 

CN L-Alanine, L-lysyl-L-threonyl-L-tyrosyl-L~threonyl-L-alanyl-L-lysyl-L- 

. alpha. -glutamyl-L-alanyl-L-alanyl- (9CI) (OA INDEX NAME) 
OTHER NAMES: 

CN 37: PN: WO0149834 SEQID: 184 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 KTYTAKEAAA 



HITS AT: 2-6 

REFERENCE 1: 135:106297 

L2 ANSWER 26 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 

RN 350473-12-0 REGISTRY 

CN L-Threonine, L- lysyl~L-thr eonyl-L- tyros yl-L-threonyl -L-alanyl-L-lysyl-i- 

alanyl-L-alanyl~L~alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES : 

CN 36: PN: WO0149834 SEQID: 183 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 KTYTAKAAAT 

HITS AT: 2-6 

REFERENCE 1: 135:106297 

L2 ANSWER 27 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350473-07-3 REGISTRY 

CN L-Threonine, L~alanyl~L-threonyl-L-tyrosyl-L-threonyl~L-alanyl-L-lysyl-L- 

. alpha. -glutamyl-L-alanyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 31: PN: WO0149834 SEQID: 178 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 ATYTAKEAAT 



HITS AT: 2-6 

REFERENCE 1: 135:106297 

L2 ANSWER 28 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350473-05-1 REGISTRY 

CN L-Lysine, L~seryl-L- . alpha . -glutamyl-L-phenylalanyl-L- . alpha . -aspartyl-L- 
lysyl-L-threonyl-L-phenylalanyl-L-threonyl-L-alanyl- ( 9CI ) (CA INDEX 

NAME) 
OTHER NAMES: 

CN 28: PN: WO0149834 SEQID: 176 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 SEFDKTFTAK 
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HITS AT: 6-10 

REFERENCE 1: 135:106297 

L2 ANSWER 29 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350472-98-9 REGISTRY 

ON L-Lysine, L-seryl-L- . alpha . -glutamyl-L-phenylalanyl-L- . alpha . -aspartyl~L 

alanyl-L-threonyl-L-tyrosyl-L-threonyl-L-alanyl- (901) (CA INDEX NAME) 
OTHER NAMES: 

CN 21: PN: WO0149834 SEQID: 169 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 SEFDATYTAK 



HITS AT: 6-10 

REFERENCE 1: 135:106297 

L2 ANSWER 30 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350472-97-8 REGISTRY 

CN L-Lysine, L-seryl-L- . alpha . -glutamyl-L-phenylalanyl-L-alanyl-L-lysyl-L- 

threonyl-L-tyrosyl-L-threonyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 20: PN: WO0149834 SEQID: 168 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 SEFAKTYTAK 



HITS AT: 6-10 

REFERENCE 1: 135:106297 

L2 ANSWER 31 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350472-96-7 REGISTRY 

CN L~Lysine, L-seryl-L- . alpha . -glutamyl-L-alanyl-L- . alpha . ~aspartyl-L-lysyl 

threonyl-L-tyrosyl-L-threonyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 19: PN: WO0149834 SEQID: 167 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 SEADKTYTAK 



HITS AT: 6-10 

REFERENCE 1: 135:106297 

L2 ANSWER 32 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 

RN 350472-95-6 REGISTRY 

CN L-Lysine, L~seryl-L-alanyl-L-phenylalanyl-L- . alpha . -aspartyl-L-lysyl-L- 

threonyl-L-tyrosyl~L-threonyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 18: PN: WO0149834 SEQID: 166 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 SAFDKTYTAK 



HITS AT: 6-10 

REFERENCE 1: 135:106297 
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L2 ANSWER 33 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 

RN 350472-94-5 REGISTRY 

CN L-Lysine, L-alanyl-L~ . alpha . -glutamyl-L-phenylalanyl~L- . alpha . -aspartyl- 

lysyl-L-threonyl-L-tyrosyl-L-threonyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 17: PN: WO0149834 SEQID: 165 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 AEFDKTYTAK 



HITS AT: 6-10 

REFERENCE 1: 135:106297 

L2 ANSWER 3 4 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350258-06-9 REGISTRY 

CN L-Threonine, L-lysyl-L-threonyl-L-tyrosyl-L-threonyl-L-alanyl-L-lysyl-L- 

. alpha. -glutamyl-L-alanyl-L-alanyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 126: PN: WO0149834 SEQID: 77 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 KTYTAKEAAT 



HITS AT: 2-6 

REFERENCE 1: 135:106297 

L2 ANSWER 35 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350258-05-8 REGISTRY 

CN L~Lysine, L-seryl-L- . alpha . -glutamyl-L-phenylalanyl-L- . alpha . -aspartyl-L 

lysyl-L-threonyl-L-tyrosyl-L-threonyl-L-alanyl- {9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 125: PN: WO0149834 SEQID: 65 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 SEFDKTYTAK 



HITS AT: 6-10 

REFERENCE 1: 135:106297 

L2 ANSWER 36 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 350229-19-5 REGISTRY 

CN L-Lysine , L-cysteinyl-L-isoleucyl-L-phenylalanyl-L- . alpha . ~aspartyl-L- 
seryl-L-threonyl-L-tyrosyl-L-threonyl-L-alanyl- (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 37: PN: WO0149834 SEQID: 284 claimed sequence 
SQL 10 
SQL 10 

SEQ 1 CIFDSTYTAK 



HITS AT: 6-10 

REFERENCE 1: 135:1062 97 

L2 ANSWER 37 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 345600-52-4 REGISTRY 
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CN L- Lysine, L-phenylalanyl-L-seryl-L-seryl-L-seryl-L~prolyl-L-threonyl-L- 

tyrosyl-L-seryl-L-prolyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1470: PN: WO0142276 SEQID: 1487 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 FSSSPTYSPK 

HITS AT: 6-10 

REFERENCE 1: 135:5 604 3 

L2 ANSWER 38 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 345600-50-2 REGISTRY 

CN L-Asparagine, L-seryl-L-seryl-L-seryl-L-prolyl-L-threonyl-L-tyrosyl-L- 

seryl-L-prolyl-L-lysyl- (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 1468: PN : WO0142276 SEQID: 1485 unclaimed sequence 
SQL 10 
SQL 10 

SEQ 1 SSSPTYSPKN 

HITS AT: 5-9 

REFERENCE 1: 135:5 6043 

L2 ANSWER 39 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 345315-58-4 REGISTRY 

CN L-Serine, L-asparaginyl-L-phenylalanyl-L-threonyl-L-seryl-L-lysyl-L- 

tyrosyl-L-histidyl-L-methionyl-L-lysyl-L-valyl-L-leucyl-L-tyrosyl-L-leucyl- 
{9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 221: PN: WO014 8245 ' SEQID : 750 claimed protein 
SQL 14 
SQL 14 

SEQ 1 NFTSKYHMKV LYLS 

HITS AT: 1-5 

REFERENCE 1: 135:88016 

L2 ANSWER 40 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 339523-85-2 REGISTRY 

CN L-Valine, L-methionyl-L-prolyl-L-leucyl-L-threonyl-L-alanyl-L~leucyl-L- 

. alpha . -glutamyl-L-isoleucyl-L-lysyl-L- . alpha . -aspartyl-L-lysyl-L-threonyl- 
L-phenylalanyl-L-seryl-L-seryl-L-lysyl-L-phenylalanyl-L-arginylglycyl-L- 
tyrosyl-L-seryl-L- . alpha . -glutamyl-L- . alpha . -glutamyl-L- . alpha . -glutamyl- 
(9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 138: PN: WO0132882 FIGURE: 1 claimed sequence 
SQL 25 
SQL 25 

SEQ 1 MPLTALEIKD KTFSSKFRGY SEEEV 

HITS AT: 12-16 
REFERENCE 1: 134:364015 

L2 ANSWER 41 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
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RN 337 966-3 6-6 REGISTRY 

CN L-Leucine, L-methionyl~L-alanyl-L-phenylalanyl-L-arginyl-L-hist idyl-L~ 

leucyl~L-isoleucyl-L-- , alpha . -glutamyl-L-phenylalanyl-L-threonyl-L-tyrosyl- 
L-threonyl-L-alanyl-L-lysyl- (9CI) (CA INDEX NAME) 

OTHER NAMES: 

CN 7: PN: WO0130827 SEQID: 7 unclaimed sequence 
SQL 15 
SQL 15 



SEQ 



HITS AT: 



1 MAFRHLIEFT YTAKL 



REFERENCE 



10-14 



1: 134:336710 



L2 ANSWER 42 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 304868-61-9 REGISTRY 

CN L~Threoninamide/ 5~oxo-L-prolyl-L-leucyl-L-asparaginyl-L-phenylalanyl-L- 

threonyl-L-prolyl-L-lysyl-L-tryptophylglycyl- (9CI) (CA INDEX NAME) 
NTE modified 



type 


location 


description 


terminal mod, 
uncommon 


Thr-10 
Glp-1 


C- terminal amide 



SQL 10 
SQL 10 

SEQ 1 XLNFTPKWGT 

HITS AT: 3-7 



REFERENCE 



133:347529 



L2 ANSWER 43 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 260388-05-4 REGISTRY 

CN L-Glutamine, L-cysteinyl-L- . alpha . -glutamyl -L-seryl-L-seryl-L-phenylalanyl - 
L-seryl-L-leucyl-L-asparaginyl-L-methionyl-L-asparaginyl-L-phenylalanyl-L- 
seryl-L-seryl-L-lysyl-L-arginyl-L-threonyl-L-lysyl-L-phenylalanyl-L-lysyl- 
L-isoleucyl-L-threonyl-L-threonyl-L-seryl-L-methionyl- ( 9CI ) (CA INDEX 

NAME) 
OTHER NAMES: 

CN 83: PN: WO02055665 SEQID: 
CN 87: PN: WO0012679 SEQID: 
SQL 25 
SQL 25 



83 unclaimed sequence 
83 unclaimed sequence 



SEQ 1 CESSFSLNMN FSSKRTKFKI TTSMQ 

HITS AT: 10-14 
REFERENCE 1: 137:106028 
REFERENCE 2: 132:204851 

L2 ANSWER 44 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 204440-33-5 REGISTRY 

CN L-Lysine, L-threonyl-L-tyrosyl-L-seryl-L-alanyl- (9CI) (CA INDEX NAME) 
SQL 5 
SQL 5 



SEQ 



1 TYSAK 
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HITS AT: 



1-5 



REFERENCE 



128:228247 



L2 
RN 
CN 



SQL 
SQL 



ANSWER 4 5 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
185557-7 6-0 REGISTRY 

Glycine, L-tyrosyl-L-serylglycyl-L-cysteinylglycylglycyl~L-asparaginyl-L- 

. alpha . -glutamyl-L-asparaginyl-L-asparaginyl-L-phenylalanyl-L-threonyl-L- 

seryl-L-lysyl-L-glutaminyl-L- . alpha . -glutaniyl-L-cysteinyl-L-leucyl-L- 

arginyl-L-alanyl-L-cysteinyl-L-lysyl-L-lysyl- (9CI) (CA INDEX NAME) 

24 

24 



SEQ 1 YSGCGGNENN FTSKQECLRA CKKG 

HITS AT: 10-14 



REFERENCE 



1: 129:156937 



REFERENCE 



2: 126:84615 



L2 ANSWER 4 6 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 171281-94-0 REGISTRY 

CN Glycine^ glycyl-L-asparaginyl-L- . alpha . -glutamyl-L-asparaginyl-L- 

asparaginyl-L-phenylalanyl-L-threonyl-L-seryl~L~lysyl~L-glutaminyl-L- 
. alpha . -glutaiuyl-L-cysteinyl-L-leucyl-L-arginyl-L~alanyl-L-cysteinyl-L- 
lysyl-L-lysyl- (9CI) (CA INDEX NAME) 

SQL 19 

SQL 19 

SEQ 1 GNENNFTSKQ ECLRACKKG 

HITS AT: 5-9 



REFERENCE 
REFERENCE 
REFERENCE 



130: 10630 

127:104342 

124:781 



L2 ANSWER 47 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 161510-72-1 REGISTRY 

CN Hypertrehalosemic hormone (Blabera discoidalis) , 7- [N6- [N- [N- (N-acetyl~L- 
seryl) -L-seryl] -L-seryl] -D-lysine] - (9CI) (CA INDEX NAME) 

NTE multichain 
modified 



SQL 13,10,3 
SQL 13,10,3 



type 






description 


terminal 


mod. 


Thr-10 


C-terminal amide 


terminal 


mod. 


Ser-1' 


N-acetyl 


bridge 




Lys-7 - Ser-3' 


amide bridge 


uncommon 




Glp-1 




stereo 




Lys-7 


D 



SEQ 1 XVNFSPKWGT 

HITS AT: 3-7 
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**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 122:183539 

L2 ANSWER 48 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 161389-55-5 REGISTRY 

CN Hypertrehalosemic hormone (Blabera discoidalis) , 7- [N6- [N- [N- (N-acetyl- 
seryl) -L-seryl] -L-seryl] -L-lysine] - (9CI) {CA INDEX NAME) 

NTE multichain 
modified 



type 




location 


description 


terminal 


mod . 


Thr-lO 


C-terminal amide 


terminal 


mod. 


Ser-1' 


N-acetyl 


bridge 




Lys-7 - Ser-3' 


amide bridge 


uncommon 




Glp-1 





SQL 13,10,3 
SQL 13,10,3 



SEQ 1 XVNFSPKWGT 



HITS AT: 3-7 



**RELATED SEQUENCES AVAILABLE WITH SEQLINK** 
REFERENCE 1: 122:183539 

L2 ANSWER 4 9 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 161389-51-1 REGISTRY 

CN Hypertrehalosemic hormone (Blabera discoidalis ) , 7- [N6~ [ 3- [ 4-hydroxy-3- 

{ iodo-1251) phenyl] -1-oxopropyl] -L-lysine] - (9CI) (CA INDEX NAME) 
NTE modified 



type 


location 


description 


terminal mod. 


Thr-10 


C-terminal amide 


uncommon 


Glp-1 




modification 


Lys-7 


undetermined modification 



SQL 10 
SQL 10 



SEQ 1 XVNFSPKWGT 



HITS AT: 3-7 

REFERENCE 1: 122:183539 

L2 ANSWER 50 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 

RN 159928-16-2 REGISTRY 

CN L-Glutamic acid, N- [N- [N2- [N2- [N2- [1- [N- [N- (N2-L-seryl-L-asparaginyl ) -L 
tyrosyl] -L-threonyl] -L-prolyl] -L-lysyl] -L-asparaginyl] -L-lysyl] -L- 
isoleucyl]- (9CI) (CA INDEX NAME) 

SQL 10 

SQL 10 

SEQ 1 SNYTPKNKIE 



HITS AT: 2-6 
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REFERENCE 



122:53472 



L2 
RN 
CN 



NTE 



ANSWER 51 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 

14 8 693-21-4 REGISTRY 

L-Alanine, N- [N6- [ (1, 1-dimethylethoxy ) carbonyl] -N2- [N- [O- (1, 1- 

dimethylethyl) -N- [O- (1, 1-dirtiethylethyl ) -N- [0- (1, 1-dimethylethyl ) -N- 

[ (phenylmethoxy) carbonyl] -L-threonyl] -L~tyrosyl] -L-threonyl] -L-alanyl] -L- 

lysyl]-, 1, l-dimethylethyl ester (9CI) (CA INDEX NAME) 

modified 



type 



location 



description 



modification Thr-1 

modification Thr-1 

modification Tyr-2 

modification Thr-3 

modification Lys-S 



(phenylmethoxy) carbonyl<Z> 

Ir l-dimethylethyl<t-Bu> 

1, l-dimethylethyl<t-Bu> 

1, l-dimethylethyl<t-Bu> 

(1, 1-dimethylethoxy) carbonyl<Boc> 



SQL 6 
SQL 6 

SEQ 1 TYTAKA 

HITS AT: 1-5 



REFERENCE 



119: 49901 



L2 ANSWER 52 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 148693-03-2 REGISTRY 

CN L-Alanine, N- [N2- [N- [N- [N- [N- (N2-glycyl-L-lysyl ) -L-threonyl] -L-tyrosyl] -L- 

threonyl] -L-alanyl] -L~lysyl] - ( 9CI ) (CA INDEX NAME) 
SQL 8 
SQL 8 

SEQ 1 GKTYTAKA 

HITS AT: 3-7 



REFERENCE 



119:49901 



L2 ANSWER 53 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 146646-03-9 REGISTRY 

CN L-Lysine, N2- [N2- [N- [N- [N- [N- [1- [N2- [N2- [N- ( 1-L-lysyl-L-prolyl ) -L- 

threonyl] -L-lysyl] -L-lysyl] -L-prolyl] -L-threonyl] -L-phenylalanyl] -L- 
threonyl] -L-threonyl] -L-lysyl]~ (9CI) (CA INDEX NAME) 

SQL 12 

SQL 12 

SEQ 1 KPTKKPTFTT KK 

HITS AT: 7-11 



REFERENCE 



118 : 166893 



L2 
RN 
CN 



SQL 
SQL 



ANSWER 54 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
143681-09-8 REGISTRY 

L-Tyrosine, N- [N- [N2- [N2- [1- [N2- [N- [N- [N- [N2- (N-L-lysyl-L-threonyl ) -L- 

asparaginyl] -L-tyrosyl] -L-seryl] -L-threonyl] -L-lysyl] -L-prolyl] -L- 

glutaminyl] -L-lysyl] -L-seryl] - (9CI) (CA INDEX NAME) 

12 

12 
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SEQ 1 KTNYSTKPQK SY 

HITS AT: 3-7 

REFERENCE 1: 117:185027 

L2 ANSWER 55 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 141631-39-2 REGISTRY 

CN L-Lysine, L~tyrosyl-L- . alpha . -aspartyl-L-seryl-L-asparaginyl-L-tyrosyl-L- 
threonyl-L-prolyl-L-lysyl-L~asparaginyl-L-lysyl-L-isoleucyl-L- . alpha . - 
glutamyl-L-arginyl-L-asparaginyl-L-histidyl-L-lysyl-L-tyrosyl-L- . alpha. - 
aspartyl-L~isoleucyl- ( 9CI) (CA INDEX NAME) 

SQL 20 

SQL 20 

SEQ 1 YDSNYTPKNK lERNHKYDIK 

HITS AT: 4-8 

REFERENCE 1: 116:252016 

L2 ANSWER 56 OF 5 9 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 141631-38-1 REGISTRY 

CN L-Lysine, L-tyrosyl-L-prolyl-L-valyl-L-leucyl~L-valyl-L-asparaginyi-L- 

phenyla lanyl-L-a spar aginyl-L-ser yl~L-asparaginy 1-L-a spar aginyl-L- tyro syl-L- 
threonyl-L-tyrosyl-L- . alpha . -aspartyl-L~seryl-L-asparaginyl-L-tyrosyl-L- 
threonyl-L-prolyl- {9CI) (CA INDEX NAME) 

SQL 21 

SQL 21 

SEQ 1 YPVLVNFNSN NYTYDSNYTP K 

HITS AT: 17-21 
REFERENCE 1: 125:151136 
REFERENCE 2: 116:252016 

L2 ANSWER 57 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 138166-46-8 REGISTRY 

CN L-Isoleucine, L-asparaginyl-L-tyrosyl-L-threonyl-L-prolyl-L-lysyl-L- 

asparaginyl-L-lysyl-L-isoleucyl-L- . alpha . -glutamyl-L-arginyl-L-asparaginyl- 
L-histidyl-L-lysyl-L~tyrosyl-L- . alpha , -aspartyl-L-isoleucyl-L-lysyl-L- 
leucyl-L-threonyl- (9CI) {CA INDEX NAME) 

SQL 20 

SQL 20 

SEQ 1 NYTPKNKIER NHKYDIKLTI 

HITS AT: 1-5 



REFERENCE 1 
REFERENCE 2 
REFERENCE 3 



126:250187 
121 :203095 
116: 19422 



L2 ANSWER 58 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 138038-10-5 REGISTRY 

CN Somatostatin (sheep reduced), 3-L-glutamic acid-14-L-lysine-, cyclic 

(3. fwdarw. 14) -peptide (9CI) (CA INDEX NAME) 
OTHER CA INDEX NAMES: 

CN 1,4,7,10, 13, 16, 19, 22, 25, 28, 31, 37-Dodecaazacyclodotetracontane, 
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somatostatin deriv, 

NTE 



type 


location 


description 


bridge 


Glu-3 - Lys-14 


lactam 


SQL 14 
SQL 14 






SEQ 1 


AGEKNFFWKT FTSK 




HITS AT: 


10'-14 




REFERENCE 


1: 116:21468 





L2 ANSWER 59 OF 59 REGISTRY COPYRIGHT 2003 ACS on STN 
RN 71645-27-7 REGISTRY 

CN Somatostatin (sheep reduced) , l-de-L-alanine-2-deglycine-3-de-L-cysteine- 

14-de-L-cysteine-, cyclic ( 13 . fwdarw. 4 ) -peptide (9CI) (CA INDEX NAME) 
OTHER CA INDEX NAMES: 

CN 1, 4, 7 , 10, 13, 16, 19, 22, 25, 28-Decaazacyclotriacontane, cyclic peptide deriv. 
NTE cyclic 
SQL 10 
SQL 10 

SEQ 1 NFFWKTFTSK 

HITS AT: 6-10 
REFERENCE 1: 91:211830 
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